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Preface 


The NASA Office of Aeronautics and Space Technology (OAST) has established the 
goal of providing a technology base so that NASA can accomplish future missions with 
a several-orders-of-magnitude increase in mission effectiveness at reduced cost. To realize 
this goal, a highly focused program must be established advancing technologies that 
promise substantial increases in capability and/or substantial cost savings. The Study 
Group on Machine Intelligence and Robotics was established to assist NASA technology 
program planners to determine the potential in these areas. Thus, the Study Group had 
the following objectives: 

(1) To identify opportunities for the application of machine 
intelligence and robotics in NASA missions and systems. 

(2) To estimate the benefits of successful adoption of machine 
intelligence and robotics techniques and to prepare forecasts 
of their growth potential. 

(3) To recommend program options for research, advanced devel- 
opment, and implementation of machine intelligence and 
robot technology for use in program planning. 

(4) To broaden communication among NASA centers and uni- 
versities and other research organizations currently engaged in 
machine intelligence and robotics research. 
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Foreword 


This publication, complete with appendant documentation, is the final report of the 
NASA Study Group on Machine Intelligence and Robotics. As you will note in the 
Introduction, Section I, the report tells why the Study Group was gathered together* and 
what the Group felt and hoped to do. You will see that Section II is a timely tutorial on 
machine intelligence and robotics inasmuch as both fields may be really neoteric to a lot 
of assiduous readers. 

NASA’s needs and the applications of machine intelligence and robotics in the space 
program are discussed for you in Sections III and IV. Section V discusses the generic 
topic, Technological Opportunities, in two subsections. A, Trends in Technology, B, 
Relevant Technologies, and a third subsection, which is an Appendix on Relevant Tech- 
nologies. (Don’t skip any of these subsections, especially the third, because if you look 
there, you will find detailed discussions of the conclusions and recommendations which 
the Group made on each specific machine intelligence and robotics subject or topic.) 

After 25 hundred man-hours, the Study Group and the workshop participants arrived 
at a few prenotions concerning the state of the art situation as it exists in NASA with 
regard to the machine intelligence and the robotics fields. The study members and work- 
shop participants then conclude that four things may be better in NASA if four recom- 
mended items are adopted-as they so wrote in Section VI. 

Appendix A tells who the Study Group people are, their organizations, interests, 
backgrounds, and some accomplishments. The appendix itemizes what the workshop 
subjects or topics were; and where and when the study actions were done at five locations 
in the United States. The people-participants (and what they talked about) are also listed 
for you in Appendix A. Appendixes B (Minsky, 1961), C (Newell, 1969), D (Nilsson 
1974), E (Feigenbaum, 1978), and F (Newell, 1970) are those references which the 
Group feels will provide support for their conclusions and recommendations. 

The Study Group hopes you will read-and that you will find the report valuable and 
useful for the 1980s. 


Carl Sagan, Chairman 
Raj Reddy, Vice Chairman 
Ewald Heer, Executive Secretary 
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Section I 
Introduction 


The NASA Study Group on Machine Intelligence and 
Robotics, composed of many of the leading researchers from 
almost all of the leading research groups in the fields of 
artificial intelligence, computer science, and autonomous 
systems in the United States, met to study the influence of 
these subjects on the full range of NASA activities and to 
make recommendations on how these subjects might in the 
future assist NASA in its mission. The Study Group, chaired 
by Carl Sagan, was organized by Ewald Heer, JPL, at the 
request of Stanley Sadin of NASA Headquarters. It included 
NASA personnel, scientists who have worked on previous 
NASA missions, and experts on computer science who had 
little or no prior contact with NASA. The Group devoted 
about 2500 man-hours to this study, meeting as a full working 
group or as subcommittees between June 1977 and December 
1978. 

A number of NASA Centers and facilities were visited 
during the study. In all cases, vigorous support was offered for 
accelerated development and use of machine intelligence in 
NASA systems, with particularly firm backing offered by the 
Director of the Johnson Space Center, which the Group consid- 


ered especially significant because of JSC’s central role in the 
development of manned spaceflight. 

This report is the complete report of the Study Group. It 
includes the conclusions and recommendations with support- 
ing documentation. The conclusions represent a group con- 
sensus, although occasionally there were dissenting opinions 
on individual conclusions or recommendations. While the 
report is critical of past NASA efforts in this field — and most 
often of the lack of such efforts - the criticisms are intended 
only as constructive. The problem is government-wide, as the 
Federal Data Processing Reorganization Project 1 has stressed, 
and NASA has probably been one of the least recalcitrant 
Federal agencies in accommodating to this new technology. 

The Study Group believes that the effective utilization of 
existing opportunities in computer science, machine intelli- 
gence, and robotics, and their applications to NASA-specific 
problems will enhance significantly the cost-effectiveness and 
total information return from future NASA activities. 


*U.S. Office of Management and Budget, Federal Data Processing 
Reorganization Study. Available from National Technical Information 
Service, Washington, D.C. 


Section II 

Machine Intelligence: An Introductory Tutorial 2 


Many human mental activities, such as writing computer 
programs, doing mathematics, engaging in common sense 
reasoning, understanding language, and even driving an auto- 
mobile, are said to demand “intelligence.” Over the past few 
decades, several computer systems have been built that can 
perform tasks such as these. Specifically, there are computer 
systems that can diagnose diseases, plan the synthesis of 


2 This section is based on copyrighted material in Nils J. Nilsson’s book 

Principles of Artificial Intelligence available from Tioga Publishing 
Company, Palo Alto, California. The Study Group wishes to thank 
Nilsson for his permission for use of this material. 


complex organic chemical compounds, solve differential equa- 
tions in symbolic form, analyze electronic circuits, understand 
limited amounts of human speech and natural language text, 
and write small computer programs to meet formal specifica- 
tions. We might say that such systems possess some degree of 
artificial intelligence. 

Most of the work on building these kinds of systems has 
taken place in the field called Artificial Intelligence (AI) 3 . This 


3 In this report the terms Machine Intelligence and Artificial Intelligence 
are used synonymously. 
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work has had largely an empirical and engineering orientation. 
Drawing from a loosely structured but growing body of 
computational techniques, Ai systems are developed, undergo 
experimentation, and are improved. This process has produced 
and refined several general AI principles of wide applicability. 

AI has also embraced the larger scientific goal of construct- 
ing an information-processing theory of intelligence. If such a 
science of intelligence could be developed, it could guide the 
design of intelligent machines as well as explicate intelligent 
behavior as it occurs in humans and other animals. Since the 
development of such a general theory is still very much a goal 
rather than an accomplishment of AI, we limit our attention 
here to those principles that are relevant to the engineering 
goal of building intelligent machines. Even with this more 
limited outlook, discussion of AI ideas might well be of 
interest to cognitive psychologists and others attempting to 
understand natural intelligence. 

In the rest of this section, we will provide a broad overview 
of several different problem areas in which AI methods and 
techniques have been applied. 

A. Robotics 

The problem of controlling the physical actions of a mobile 
robot might not seem to require much intelligence. Even small 
children are able to navigate successfully through their 
environment and to manipulate items, such as light switches, 
toy blocks, eating utensils, etc. However these same tasks,’ 
performed almost unconsciously by humans, when performed 
by a machine require many of the same abilities used in solving 
more intellectually demanding problems. 

Research on robots or robotics has helped to develop many 
AI ideas. It has led to several techniques for modeling world 
states and for describing the process of change from one world 
state to another. It has led to a better understanding of how to 
generate plans for action sequences and how to monitor the 
execution of these plans. Complex robot control problems 
have forced us to develop methods for planning first at a high 
level of abstraction, ignoring details, and then at lower and 
lower levels, where details become important. Nilsson’s book 
contains several examples of robot problem solving which 
illustrate important ideas in AI. 

B. Perception Problems 

Attempts have been made to fit computer systems with 
television inputs to enable them to “see” their surroundings or 
to fit them with microphone inputs to enable them to “hear” 


speaking voices. From these experiments, it has been learned 
that useful processing of complex input data requires “under- 
standing and that understanding requires a large base of 
knowledge about the things being perceived. 

The process of perception studied in artificial intelligence - 
usually involves a set of operations. A visual scene, say, is 
encoded by sensors and represented as a matrix of intensity 
values. These are processed by detectors that search “for 
primitive picture components such as line segments, simple 
curves, comers, etc. These, in turn, are processed to infer 
information about the three-dimensional character of the 
scene in terms of its surfaces and shapes. The ultimate goal is 
to represent the scene by some appropriate model. This model 
might consist of a high-level description such as “A hill with a 
tree on top with cattle grazing.” 

The point of the whole perception process is to produce a 
condensed representation to substitute for the unmanageably 
immense, raw input data. Obviously, the nature and quality of 
the final representation depend on the goals of the perceiving 
system. If colors are important, they must be noticed; if 
spatial relationships and measurements are important, ttey 
must be judged accurately. Different systems have different 
goals, but all must reduce the tremendous amount of sensory 
data at the input to a manageable and meaningful description. 

The main difficulty in perceiving a scene is the enormous 
number of possible candidate descriptions in which the system' 
might be interested. If it were not for this fact, one could 
conceivably build a number of detectors to decide the 
category of a scene. The scene’s category could then serve as 
its description. For example, perhaps a detector could be built 
that could test a scene to see if it belonged to the category “A 
hiU with a tree on top with cattle grazing.” But why should 
this detector be selected instead of the countless others that 
might have been used? 

The strategy of making hypotheses about various levels of 
description and then testing these hypotheses seems to offer 
an approach to this problem. Systems have been constructed 
that process suitable representations of a scene to develop . 
hypotheses about the components of a description. These, 
hypotheses are then tested by detectors that are specialized to 
the component descriptions. The outcomes of these tests, in 
turn, are used to develop better hypotheses, etc. 

This hypothesize-and-test paradigm is applied at many 
levels of the perception process. Several aligned segments 
suggest a straight line; a line detector can be employed to test 
it. Adjacent rectangles suggest the faces of a solid prismatic 
object; an object detector can be employed to test it. 
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The process of hypothesis formation requires a large 
amount of knowledge about the expected scenes. Some 
Artificial Intelligence researchers have suggested that this 
knowledge be organized in a special structure called a frame or 
schema. For example, when a robot enters a room through a 
doorway, it activates a room schema, which loads into a 
working memory a number of expectations about what might 
be seen next. Suppose the robot perceives a rectangular form. 
This form, in the context of a room schema, might suggest a 
window. The window schema might contain the knowledge 
that windows typically do not touch the floor. A special 
detector, applied to the scene, confirms this expectation, thus 
raising confidence in the window hypothesis. Nilsson’s book 
discusses various fundamental ideas underlying frame- 
structured representations and inference processes. 


C. Combinatorial and Scheduling 
Problems 

An interesting class of problems is concerned with specify- 
ing optimal schedules or combinations. Many of these prob- 
lems can be attacked by the methods of AI. A classical 
example is the traveling salesman’s problem. .The problem here 
is to find a minimum distance tour, starting at one of several 
cities, visiting each city precisely once, and returning to the 
starting city. The problem generalizes to one of finding a 
minimum cost path over the edges of a graph containing n 
nodes such that the path visits each of the n nodes precisely 
once. 

Many puzzles have this same general character. Another 
example is the eight-queens problem. The problem is to place 
eight queens on a standard chessboard in such a way that no 
queen can capture any of the others; that is, there can be no 
more than one queen in any row, column, or diagonal. In most 
problems of this type, the domain of possible combinations or 
sequences from which to choose an answer is very large. 
Routine attempts at solving these types of problems soon 
generate a combinatorial explosion of possibilities that exhaust 
even the capacities of large computers. 

Several of these problems (including the traveling salesman 
problem) are members of a class that computational theorists 
call NP-complete. Computational theorists rank the difficulty 
of various problems on how the worst case for the time taken 
(or number of steps taken) to solve them by the theoretically 
best method grows with some measure of the problem size. 
(For example, the number of cities would be a measure of the 
size of a traveling salesman problem.) Thus, problem difficulty 
may grow linearly, polynomial^, or exponentially, for exam- 
pie, with problem size. 


The time taken by the best methods currently known for 
solving NP-complete problems grows exponentially with prob- 
lem size. It is not yet known whether faster methods 
(involving only polynomial time, say) exist; but it has been 
proven that if a faster method exists for one of the 
NP-complete problems, then this method can be converted to 
similarly faster methods for all the rest of the NP-complete 
problems. In the meantime, we must make do with expo- 
nential-time methods. 

AI researchers have worked on methods for solving several 
types of combinatorial problems. Their efforts have been 
directed at making the time-versus-problem-size curve grow as 
slowly as possible, even when it must grow exponentially. 
Several methods have been developed for delaying and 
moderating the inevitable combinatorial explosion. Again, 
knowledge about the problem domain is the key to more 
efficient solution methods. Many of the methods developed to 
deal with combinatorial problems are also useful on other, less 
combinatorially severe problems. 


D. Automatic Programming 

The task of writing a computer program is also related to 
other areas of AI. Much of the basis research in automatic 
programming, theorem proving, and robot problem solving 
overlaps. In a sense, existing compilers already do “automatic 
programming.” They take in a complete source code specifica- 
tion of what a program is to accomplish; they write an object 
code program to do it. What we mean here by automatic 
programming might be described as a “super-compiler” or a 
program that could take in a very high-level description of 
what the program is to accomplish and from it produce a 
program. The high-level description might be a precise state- 
ment in a formal language, such as the predicate calculus, or it 
might be a loose description, say, in English, that would 
require further dialogue between the system and the user in 
order to resolve ambiguities. 

The task of automatically writing a program to achieve a 
stated result is closely related to the task of proving that a 
given program achieves a stated result. The latter is called 
program verification. Many automatic programming systems 
produce a verification of the output program as an added 

benefit. 

One of the important contributions of research in auto- 
matic programming has been the notion of debugging as a 
problem-solving strategy. It has been found that it is often 
much more efficient to produce an errorful solution to a 
programming or robot control problem cheaply and then 
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modify it (to make it work correctly), than to insist on a first 
solution completely free of defects. 


E. Expert Consulting Systems 

AI met hods have also been employed in the development of 
automatic consulting systems. These systems provide human 
users with expert conclusions about specialized subject areas. 
Automatic consulting systems have been built that can 
diagnose diseases, evaluate potential ore deposits, suggest 
structures for complex organic chemicals, and even provide 
advice about how to use other computer systems. 


A key problem in the development of expert consulting 
systems is how to represent and use the knowledge that human 
experts in these subjects obviously possess and use. This 
problem is made more difficult by the fact that the expert 
knowledge in many important fields is often imprecise, 
uncertain, or anecdotal (though human experts use such 
knowledge to arrive at useful conclusions). 


Many expert consulting systems employ the AI technique 
of rule-based deduction. In such systems, expert knowledge is 
represented as a large set of simple rules, and these rules are 
used to guide the dialogue between the system and the user 
and to deduce conclusions. Rule-based deduction is one of the 
major topics in Nilsson’s book. 


F. Natural Language Processing 

When humans communicate with each other using language, 
they employ, almost effortlessly, extremely complex and still 
little understood processes. It has been very difficult to 
develop computer systems capable of generating and “under- 
standing” even fragments of a natural language, such as 
English. One source of the difficulty is that language has 
evolved as a communication medium between intelligent 
beings. Its primary purpose is to transmit a bit of “mental 
structure” from one brain to another under circumstances in 
which each brain possesses large, highly similar surrounding 
mental structures that serve as a common context. Further- 
more, part of these similar, contextual mental structures 
allows each brain to know that the other brain also possesses 
this common structure and that the other brain can and will 
perform certain processes using it during its communication 
acts.” The evolution of language use has apparently exploited 
the opportunity for each brain to use its considerable 
computational resources and shared knowledge to generate 
and understand highly condensed and streamlined messages- A 
word to the wise from the wise is sufficient. Thus, generating 


and understanding language is an encoding and decoding 
problem of fantastic complexity. 

A computer system capable of understanding a message in 
natural language would seem, then, to require (no less thaa 
would a human) both the contextual knowledge and the 
processes for making the inferences (from this contextual 
knowledge and from the message) assumed by the message 
generator. Some progress has been made toward compute'r 
systems of this sort, for understanding spoken and written 
fragments of language. Fundamental to the development of 
such systems are certain AI ideas about structures for 
representing contextual knowledge and certain techniques for 
making inferences from that knowledge. Although Nilsson’s 
book does not treat the language-processing problem in detail, 
it does describe some important methods for knowledge 
representation and processing that do find application in 
language-processing systems. 


G. Intelligent Retrieval 
From Databases 

Database systems are computer systems that store a large 
body of facts about some subject in such a way that they can 
be used to answer user’s questions about that subject. To take ' 
a specific example, suppose the facts are the personnel records 
of a large corporation. Example items in such a database might 
be representations for such facts as “Joe Smith works in the 
Purchasing Department,” “Joe Smith was hired on October 8, 
1976,” ‘The Purchasing Department has 17 employees,”’ 
John Jones is the manager of the Purchasing Department,” 

The design of database systems is an active subspecialty of 
computer science, and many techniques have been developed 
to enable the efficient representation, storage, and retrieval of 
large numbers of facts. From our point of view, the subject 
becomes interesting when we want to retrieve answers that 
require deductive reasoning with the facts in the database. 

There are several problems that confront the designer of 
such an intelligent information retrieval system. First, there is 
the immense problem of building a system that can understand 
queries stated in a natural language like English. Second, even 
if the language-understanding problem is dodged by specifying 
some formal, machine-understandable query language, the 
problem remains of how to deduce answers from stored facts. 
Third, understanding the query and deducing an answer may 
require knowledge beyond that explicitly represented in the 
subject domain database. Common knowledge (typically omit- 
ted in the subject domain database) is often required. For 
example, from the personnel facts mentioned above, an 
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intelligent system ought to be able to deduce the answer 
“John Jones” to the query “Who is Joe Smith’s boss?” Such a 
system would have to know somehow that the manager of a 
department is the boss of the people who work in that 
department. How common knowledge should be represented 
and used is one of the system design problems that invites the 
methods of Artificial Intelligence. 

H. Theorem Proving 

Finding a proof (or disproof) for a conjectured theorem in 
mathematics can certainly be regarded as an intellectual task. 
Not only does it require the ability to make deductions from 
hypotheses but it also demands intuitive skills such as guessing 
about which lemmas should be proved first in order to help 
prove the main theorem. A skilled mathematician uses what he 
might call judgment (based on a large amount of specialized 
knowledge) to guess accurately about which previously proven 
theorems in a subject area will be useful in the present proof 
and to break his main problem down into subproblems to 
work on independently. Several automatic theorem pioving 
programs have been developed that possess some of these same 
skills to a limited degree. 

The study of theorem proving has been of significant value 
in the development of AI methods. The formalization of the 
deductive process using the language of predicate logic, for 
example, helps us to understand more clearly. some of the 
components of reasoning. Many informal tasks, including 
medical diagnosis and information retrieval, can be formalized 
as theorem-proving problems. For these reasons, theorem 
proving is an extremely important topic in the study of AI 
methods. 

I. Social Impact 4 


1. The Dehumanization - Alienation 
Hypothesis 

There has been a great deal of nervousness, and some 
prophetic gloom, about human work in highly automated 
organizations. An examination of such empirical evidence, and 
an analysis of the arguments that have been advanced for a 
major impact of automation upon the nature of work has led 
us to a largely negative result. 

There is little evidence for the thesis that job satisfaction 
has declined in recent years, or that the alienation of workers 
has increased. Hence, such trends, being nonexistent, cannot 
be attributed to automation, past or prospective. Trends 
toward lower trust in government and other social institutions 
flow from quite different causes. 

An examination of the actual changes that have taken place 
in clerical jobs as the result of introducing computers indicates 
that these changes have been modest in magnitude and mixed 
in direction. The surest consequence of factory and office 
automation is that it is shifting the composition of the labor 
force away from those occupations in which average job 
satisfaction has been lowest, toward occupations in which it 
has been higher. 

The argument that organizations are becoming more au- 
thoritarian and are stifling human creativity flies in the face of 
long-term trends in our society toward the weakening of 
authority relations. Moreover, the psychological premises on 
which the argument rests are suspect. Far more plausible is the 
thesis that human beings perform best, most creatively, and 
with greatest comfort in environments that provide them with 
some immediate amount of structure, including the structure 
that derives from involvement in authority relations. Just 
where the golden mean lies is hard to say, but there is no 
evidence that we are drifting farther from it. 


The impact of computers, machine intelligence, and robot- 
ics must be examined in the broader context of their impact 
on society as a whole, rather than the narrower focus based on 
NASA needs and applications. The impact of information 
processing technology (and machine intelligence and robotics) 
on society has been considered in detail by Simon. Here we 
present the conclusions derived by him. The reader is referred 
to Simon’s book for details of the reasoning and evidence that 
led to the conclusions presented here. 


Finally, while we certainly live in a world that is subject to 
continuing change, there is reason to believe that the changes 
we are undergoing are psychologically no more stressful, and 
perhaps even less, stressful, than those that our parents and 
grandparents experienced. It appears that the human conse- 
quences we may expect from factory and office automation 
are relatively modest in magnitude, that they will come about 
gradually, and that they will bring us both disadvantages and 
advantages - with the latter possibly outweighing the former. 


4 This subsection is based on material presented in The New Science of 
Management Decision t revised edition, by Herbert A. Simon, Picntice- 
Hall, Englewood Cliffs, N.J., 1977. The Study Group would like to 
thank Professor Simon and Prentice-Hall for their kind permission tor 
the use of the material. The reader is referred to Chapters 3 and 5 ot 
the book for detailed discussions that lead to the conclusions pre- 
sented here. _____ 


Reproduced from 
best available copy. 


2. The Potential for Increased 
Unemployment 

Simon presents evidence that any level of technology and 
productivity is compatible with any level ol employment, 
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including full employment. He suggests that the problems we 
face today will not cause us to retreat from high technology - 
for such a retreat would not be consistent with meeting the 
needs of the world’s population - but that they will bring 
about a substantial qualitative shift in the nature of our 
continuing technological progress. For future increases in 
human productivity, we will look more to the information- 
processing technologies than to the energy technologies. 
Because of resource limitations and because of shifting 
patterns of demand with rising real incomes, a larger fraction 
of the labor force than at present will be engaged in producing 
services, and a smaller fraction will be engaged in producing 
goods. But there is no reason to believe that we will experience 
satiety of either goods or services at full employment levels. 

3. The Impact on Resources and 
Environment 

Technology is knowledge and information-processing tech- 
nology is knowledge of how to produce and use knowledge 
more effectively. Modern instruments - those, for example, 
that allow us to detect trace quantities of contaminants in air, 
water, and food- inform us about consequences of our 
actions of which we were previously ignorant. Computers 
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applied to the modeling of our energy and environmental 
systems trace out for us the indirect effects of actions taken in 
one part of our society upon other parts. Information- 
processing technology is causing all of us to take account of * 
the consequences of our actions over spans of time and space ~ 
that seldom concerned us in the past. It is placing on us - 
perhaps forcing on us - the responsibilities of protecting . 
future generations as well as our own. In this way, the new 
technology, the new knowledge, is helping to redefine the 
requirements of morality in human affairs. 


J. Conclusion 

In this section we have attempted to provide a broad 
introductory tutorial to AI. Detailed discussion of the meth- 
ods and techniques of AI and the wide range of problem 
domains in which they have been applied is given in various 
survey articles by Minsky (1963), Newell (1969), Nilsson 
(1974), and Feigenbaum (1978) all of which appear as 
Appendixes B to E of this report. Appendix F (Newell, 1970) 
discusses the relationship between artificial intelligence and* 
cognitive psychology. (The book. Introduction to Artificial 
Intelligence by Patrick H. Winston, also provides an excellent 
introduction to the field.) 
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Section III 
NASA Needs 


NASA is, to a significant degree, an agency devoted to the 
acquisition, processing, and analysis of information — about 
the Earth, the solar system, the stars, and the universe. The 
principal goal of NASA’s booster and space vehicle commit- 
ment is to acquire such scientific information for the benefit 
of the human species. As the years have passed and NASA has 
mustered an impressive array of successful missions, the com- 
plexity of each mission has increased as the instrumentation 
and scientific objectives have become more sophisticated, and 
the amount of data returned has also increased dramatically. 
The Mariner 4 mission to Mars in 1965 was considered a strik- 
ing success when it returned a few million bits of information. 
The Viking mission to Mars, launched a decade later, acquired 
almost ten thousand times more information. Comparable 
advances have been made in Earth resources and meteoro- 
logical satellites, and across the full range of NASA activities. 
At the present time, the amount of data made available by 
NASA missions is larger than scientists can comfortably sift 
through. This is true, for example, of Landsat and other Earth 
resources technology satellite missions. A typical information 
acquisition rate in the 1980s is about 10 12 bits per day for all 
NASA systems. In two years, this is roughly the total non- 
pictorial information content of the Library of Congress. The 
problem is clearly getting much worse. We have reached a 
severe limitation in the traditional way of acquiring and 
analyzing data. 

A recent study at JPL estimates that NASA could save 
1.5 billion dollars per year by A.D. 2000 through serious 
implementation of machine intelligence. Given different 
assumptions, the saving might be several times less or several 
times more. It is clear, however, that the efficiency of NASA 
activities in bits of information per dollar and in new data 
acquisition opportunities would be very high were NASA to 
utilize the full range of modern computer science in its mis- 
sions. Because of the enormous current and expected advances 
in machine intelligence and computer science, it seems possible 
that NASA could achieve orders-of-magnitude improvement in 
mission effectiveness at reduced cost by the 1990s. 

Modern computer systems, if appropriately adapted, are 
expected to be fully capable of extracting relevant data either 
onboard the spacecraft or on the ground in user-compatible 
format. Thus, the desired output might be a direct graphic 
display of snow cover, or crop health, or global albedo, or 
mineral resources, or storm system development, or hydro- 
logic cycle. With machine intelligence and modern computer 


graphics, an immense amount of data can be analyzed and 
reduced to present the scientific or technological results 
directly in a convenient form. This sort of data-winnowing 
and content analysis is becoming possible, using the develop- 
ing techniques of machine intelligence. But it is likely to 
remain unavailable unless considerably more relevant research 
and systems development is undertaken by NASA. 

The cost of ground operations of spacecraft missions and 
the number of operations per command uplinked from ground 
to spacecraft are increasing dramatically (Figures 3-1 and 3-2). 
Further development of automation can, at the same time, 
dramatically decrease the operations costs of complex missions 
and dramatically increase the number and kinds of tasks per- 
formed, and therefore, the significance of the data returned. 
Figures 3-3 and 34 illustrate schematically how improved 
automation can produce a significant decline in the cost of 
mission operations. The projected reallocation of responsibility 
during mission operations between ground-based humans and 
spacecraft computer processing is shown in Figure 3-5. There 
are many simple or repetitive tasks which existing machine 
intelligence technology is fully capable of dealing with more 
reliably and less expensively than if human beings were in the 
loop. This, in turn, frees human experts for more difficult 
judgmental tasks. In addition, existing and projected advances 
in robot technology would largely supplant the need for 
manned missions, with a substantial reduction in cost. 



Figure 3-1. Trend of mission ground operations costs. Increasing 
mission complexity and duration contribute to the 
ground operation costs. 




Section IV 

Applications of Machine Intelligence 
and Robotics in the Space Program 5 


A. Introduction 

The space program is at the threshold of a new era that may 
be distinguished by a highly capable space transportation sys- 
tem. In the 1980s, the Space Shuttle and its adjuncts will en- 
able increased activities in the scientific exploration of the 
universe and a broadened approach to global service undertak- 
ings in space. The first steps toward utilizing the space environ- 
ment for industrial and commercial ventures will become 
possible and can trigger requirements for more advanced space 
transportation systems in the 1990s. This will enable expanded 
space industrial activities and, by the end of this century, could 
lead to Satellite Power Systems for solar energy production, 
to lunar or asteroidal bases for extracting and processing 
material resources, and to manned space stations for com- 
mercial processing and manufacturing in space. A major objec- 
tive for NASA is to develop the enabling technology and to 
reduce the costs for operating such large-scale systems during 
the next two decades. On examining potential NASA missions 
in this time frame we expect that machine intelligence and 
robotics technology will be a vital contributor to the cost- 
effective implementation and operation of the required sys- 
tems. In some areas, it will make the system feasible, not only 
for technological reasons, but also in terms of commercial 
acceptability and affordability. 

During the next two decades, the space program will shift 
at least some emphasis from exploration to utilization of the 
space environment. It is expected that this shift will be accom- 
panied by a large increase in requirements for system opera- 
tions in space and on the ground, calling for general-purpose 
automation (robotics) and specialized automation. What 
operations, tasks, and functions must be automated, and to 
what degree, to accomplish the NASA objectives with the 
most cost-effective systems? 

B. Robots and Automation in 
NASA Planning 

Whereas mechanical power provides physical amplification 
and computers provide intellectual amplification, telecom- 
munication provides amplification of the space accessible to 

5 Excerpted from^Vew Luster for Space Robots and Automation by 

Ewald Heer, Astronautics & Aeronautics, Volume 16, No 9, pp 48-60, 

September 1978. 


humans. By means of telecommunication, humans can activate 
and control systems at remote places. They can perform tasks 
even as far away as the planets. During the 1960s, this became 
known as teleoperation. Teleoperators are man -machine 
systems that augment and extend human sensory, manipu- 
lative, and cognitive abilities to remote places. In this context, 
the term robot can then be applied to the remote system of a 
teleoperator, if it has at least some degree of autonomous 
sensing, decision-making, and/or action capability. The con- 
cept of teleoperation has profound significance in the space 
program. Because of the large distances involved, almost all 
space missions fall within the teleoperator definition; and, 
because of the resultant communication delay for many 
missions, the remote system requires autonomous capabilities 
for effective operation. The savings of operations time for 
deep space missions can become tremendous, if the remote 
system is able to accomplish its tasks with minimum ground 
support. For example, it has been estimated that a Mars roving 
vehicle would be operative only 4 percent of the time in a 
so-called move-and-wait mode of operation. With adequate 
robot technology, it should be operative at least 80 percent of 
the time. 

NASA saw the need to examine the civilian role of the U.S. 
space program during the last quarter of this century. A series 
of planning studies and workshops was initiated with the Out- 
look for Space Study in 1974, which included a comprehen- 
sive forecast of space technology for 1980-2000. In a subse- 
quent NASA/OAST Space Theme Workshop, the technology 
forecasts were applied to three broad mission themes, space 
exploration, global services, and space industrialization. Based 
on the derived requirements for cost-effective space mission 
operations, five new directions were identified for develop- 
ments in computer systems, machine intelligence and robotics: 
(1) automated operations aimed at a tenfold reduction in 
mission support costs; (2) precision pointing and control, 
(3) efficient data acquisition to permit a tenfold increase in 
information collection needed for global coverage; (4) real-time 
data management; and (5) low-cost data distribution to allow 
a thousand-fold increase in information availability and 
space-systems effectiveness. The machine intelligence and 
automation technologies for data acquisition, data processing, 
information extraction, and decision making emerge here as 
the major drivers in each area and call for their systematic 
development. In addition, for certain areas such as automated 
operations in space, the mechanical technologies directed at 
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materials and objects acquisition, handling, and assembly 
must also be further developed; robots doing construction 
work in Earth orbit or on the lunar surface will need manipu- 
lative and locomotion devices to perform the necessary trans- 
port and handling operations. 

C. Future Applications 

In space applications, robots may take on many forms. 
None looks like the popular science fiction conception of a 
mechanical man. Their appearance follows strictly functional 
lines, satisfying the requirements of the mission objectives to 
be accomplished. The discussion which follows briefly presents 
mission categories, mission objectives, and system character- 
istics pertinent to space robot and automation technology. 
Estimates of technology development efforts to automate 
system functions are given in Table 4-1.' 

1. Space Exploration 

Space exploration robots may be exploring space from 
Earth orbit as orbiting telescopes, or they may be planetary 
flyby and/or orbiting spacecraft like the Mariner and Pioneer 
families. They may be stationary landers with or without 
manipulators like the Surveyor and the Viking spacecraft, or 
they may be wheeled like the Lunakhod and the proposed 
Mars rovers. Others may be penetrators, flyers, or balloons, 
and some may bring science samples back to Earth (Figures 
4-1 - 4-3). All can acquire scientific and engineering data 



Figure 4-1. Galileo spacecraft navigates between Jupiter and Galilean 
satellites in rendering. After sending a probe into the jovian 
atmosphere, the robot spacecraft will perform complex 
maneuvers at various inclinations with repeated close 
encounters with the satellites. 



Figure 4-2. Mars surface robot will operate for 2 years and travel about 
1000 km performing experiments automatically and send- 
ing the scientific information back to Earth. 



Figure 4-3. Artist's concept of a Mirs surface scientific processing and 
sample return facility. Airplanes transport samples into the 
vicinity of the processing station. Tethered small rovers 
then bring the samples to the station for appropriate 
analysis and return to Earth. 
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Table 4-1. Estimates of the technology development efforts to 
automate system functions 
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KEY: THE AUTOMATION OF THE IDENTIFIED SYSTEM FUNCTIONS REQUIRES: 


/ INTEGRATION OF EXISTING TECHNOLOGY 
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NASA/OAST SPACE SYSTEMS TECHNOLOGY MODEL, 22 MARCH 1978, * 
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using their sensors, process the data with their computers, plan 
and make decisions, and send some of the data back to Earth. 
Some robots are, in addition, able to propel themselves safely 
to different places and to use actuators, manipulators, and 
tools to acquire samples, prepare them, experiment in situ 
with them, or bring them back to Earth. 

Exploratory robots are required to send back most of the 
collected scientific data, unless they become repetitive. The 
unknown space environment accessible to the sensors is trans- 
lated into a different, still uninterpreted environment, in the 
form of computer data banks on Earth. These data banks are 
then accessible for scientific investigation long after the space 
mission is over. 

Projections into the future lead one to speculate on the 
possibility of highly autonomous exploratory robots in space. 
Such exploratory robots would communicate to Earth only 
when contacted or when a significant event occurs and requires 
immediate attention on Earth. Otherwise, they would collect 
the data, make appropriate decisions, archive them, and store 
them onboard. The robots would serve as a data bank, and 
their computers would be remotely operated by accessing and 
programming them from Earth whenever the communication 
link to the robot spacecraft is open. Scientists would be able 
to interact with the robot by remote terminal. Indeed, the 
concept of distributed computer systems, presently under 
investigation in many places, could provide to each instrument 
its own microcomputer, and scientists could communicate 
with their respective instruments. They could perform special 
data processing onboard and request the data to be communi- 
cated to them in the form desired. Alternatively, they could 
retrieve particular segments of raw data and perform the 
required manipulations in their own facilities on Earth. 

Prime elements in this link between scientists and distant 
exploratory robots would be large antenna relay stations in 
geosynchronous orbit. These stations would also provide data 
handling and archiving services, especially for inaccessible 
exploratory robots, e.g., those leaving the solar system. 

2. Global Services 

Global service robots orbit the Earth. They differ from 
exploratory robots primarily in the intended application of the 
collected data. They collect data for public service use on soil 
conditions, sea states, global crop conditions, weather, geology, 
disasters, etc. These robots generally acquire and process an 
immense amount of data. However, only a fraction of the data 
is of interest to the ultimate user. At the same time, the user 
often likes to have the information shortly after it has been 
obtained by the spacecraft. For instance, the value of weather 


information is short-lived except for possible historical reasons. 
The value of information of disasters such as forest fires is of 
comparably short duration. The demand for high-volume 
onboard data processing and pertinent automated information 
extraction is therefore great. 

The usual purpose of global service robots is to collect 
time-dependent data in the Earth’s environment, whose static 
properties are well-known. The data are used to determine 
specific patterns or classes of characteristics and translate 
these into useful information. For instance, for Landsat 
and Seasat (Figure 4-4), the data are currently sent to the 
ground, where they are processed, reduced, annotated, analy- 
zed, and distributed to the user. This process requires up to 
3 months for a fully processed satellite image and costs several 
thousand dollars. The image must then be interpreted by the 
receiver; i.e., the information must still be extracted by the 
user. 



Figure 4-4. Seasat. The oceanographic satellite's high-data-rate Synthe- 
tic Aperture Radar imaging device has provided data on 
ocean waves, coastal regions, and sea ice. 
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Present developments in artificial intelligence, machine 
intelligence, and robotics suggest that, in the future, the ground- 
based data processing and information extraction functions 
will be performed onboard the robot spacecraft. Only the 
useful information would be sent to the ground and distributed 
to the users, while most of the collected data could be dis- 
carded immediately. This would require the robot to be able 
to decide what data must be retained and how they were to 
be processed to provide the user with the desired information. 
For instance, the robot could have a large number of pattern 
classification templates stored in its memory or introduced 
by a user with a particular purpose in mind. These templates 
would represent the characteristics of objects and/or features 
of interest. The computer would compare the scanned pat- 
terns with those stored in its memory. As soon as something of 
interest appeared, it would examine it with higher resolution, 
comparing it to a progressively narrower class of templates 
until recognition had been established to a sufficient degree of 
confidence. The robot would then contact the appropriate 
ground station and report its findings and, if required, provide 
the user with an annotated printout or image. The user would 
be able to interact with the robot, indeed with his particular 
instrument, by remote terminal much the same as with a cen- 
tral computer and, depending on intermediate results, modify 
subsequent processing. 

For space exploration and global services, the ground- 
based mission operations can become extremely complex. A 
recent example of a planetary exploration mission, and perhaps 
the most complex to date, is Viking. At times there were 
several hundred people involved in science data analysis, 
mission planning, spacecraft monitoring, command sequence 
generation, data archiving, data distribution, and simulation. 
Although for earlier space missions sequencing had been deter- 
mined in advance, on Viking this was done adaptively during 
the mission. The operational system was designed so that 
major changes in the mission needed to be defined about 
16 days before the spacecraft activity. Minor changes could be 
made as late as 12 hours before sending a command. The turn- 
around time of about 16 days and the number of people 
involved contributes, of course, to sharply increased opera- 
tional costs. The Viking operations costs (Figure 3-1) are for a 
3-month mission. The planned Mars surface rover mission is 
expected to last 2 years, covering many new sites on the Mar- 
tian surface. Considering that this mission would be more 
complex and eight times as long, ground operations would have 
to be at least ten times as efficient to stay within, or close to, 
the same relative costs as for Viking. 

During the Viking mission, about 75,000 reels of image 
data tapes were collected and stored in many separate loca- 
tions. The images are now identifiable only by the time when 
and the location where they were taken. No indication regard- 


ing image information content is provided, and the user will 
have to scan catalogs of pictures to find what he or she wants. 
For such reasons, it is expected that most of the data will not 
be used again. 

The ground operations for Earth orbital missions sutler 
from problems similar to those of planetary missions. The 
overall data stream is usually much higher for Earth orbital 
missions, images are still very costly, and they take up to 
several months to reach the user. 

These considerations strongly suggest that technology 
must be developed so that most ground operation activities 
can be performed as close as possible to the sensors where the 
data is collected, namely by the robot in space. However, 
examining the various ground operations in detail, we con- 
clude that most of those that must remain on the ground could 
also be automated with advanced machine intelligence tech- 
niques. The expected benefits derived from this would be a 
cost reduction for ground operations of at least an order of 
magnitude and up to three orders of magnitude for user-ready 
image information. 

3. Utilization of Space Systems 

Space industrialization requires a broader spectrum of 
robotics and automation capabilities than those identified for 
space exploration and global services. The multitude of sys- 
tems and widely varying activities envisioned in space until 
the end of this century will require the development of space 
robot and automation technology on a broad scale. It is here 
that robot and automation technology will have its greatest 
economic impact. The systems under consideration range from 
large antennas and processing and manufacturing stations in 
Earth orbit to lunar bases, to manned space stations, to 
satellite power systems of up to 100 km 2 . These systems are 
not matched in size by anything on Earth. Their construction 
and subsequent maintenance will require technologies not yet 
in use for similar operations on Earth. 

Space processing requires a sophisticated technology. First 
it must be developed and perfected, and then it must be trans- 
ferred into the commercial arena. Basic types of processes 
currently envisioned include solidification of melts without 
convection or sedimentation, processing of molten samples 
without containers, diffusion in liquids and vapors, and electro- 
phoretic separation of biological substances. It is expected 
that specialized automated instrumentation will be developed 
for remote control once the particulars of these processes are 
worked out and the pressure of commercial requirements 
becomes noticeable. 
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Large-area systems such as large space antennas, satellite 
power systems, and space stations require large-scale and 
complex construction facilities in space (Figures 4-5 and 4-6). 
Relatively small systems, up to 100 m in extent, may be 
deployable and can be transported into orbit with one Shuttle 
load. For intermediate systems of several hundred meters in 
extent, it becomes practical to shuttle the structural elements 
into space and assemble them on site (Figure 4-7). 



Figure 4-5. Large space systems require robot and automation tech- 
nology for fabrication, assembly, and construction in 
space. 



Figure 4-6. Large space antennas are erected with the help of a 
space-based construction platform. The Shuttle brings the 
structural elements to the platform, where automatic 
manipulator modules under remote control perform the 
assembly. 


Very large systems require heavy-lift launch vehicles which 
will bring bulk material to a construction platform (Figure 
4-8), where the structural components are manufactured using 
specialized automated machines. 

The structural elements can be handled by teleoperated 
or self-actuating cranes and manipulators which bring the com- 
ponents into place and join them (Figure 4-9). Free-flying 
robots will transport the structural entities between the Shuttle 
or the fabrication site and their final destination and connect 
them. These operations require a sophisticated general-purpose 



Figure 4-7. Construction of a space station. Bulk material is brought 
by the Shuttle. Structural elements are fabricated at the 
construction facility and then assembled by remotely 
controlled manipulators. 



Figure 4-6. Complex construction facility in space with automatic 
beam builders, cranes, manipulators, etc., is served by the 
Shuttle. 
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handling capability. In addition to transporting structural 
elements, the robot must have manipulators to handle them, 
and work with them and on them. Large structural subsys- 
tems must be moved from place to place and attached to each 
other. This usually requires rendezvous, stationkeeping, and 
docking operations at several points simultaneously and with 
high precision - a problem area still not investigated for zero 
gravity. Automated “smart” tools would also be required by 
astronauts to perform specialized local tasks. 

These robot systems could be controlled remotely as 
teleoperator devices, or they could be under supervisory 
control with intermittent human operator involvement. Astro- 
nauts in space or human operators on Earth will need the tools 
to accomplish the envisioned programs. The technology for 
in-space assembly and construction will provide the founda- 
tion for the development of these space-age tools. 

After the system has been constructed, its subsequent 
operation will require service functions that should be per- 
formed by free-flying robots or by robots attached to the 
structure. The functions which such a robot should be able to 
perform include calibration, checkout, data retrieval, resupply, 
maintenance, repair, replacement of parts, cargo and crew 
transfer, and recovery of spacecraft. 



Figure 4*9. Space construction of large antenna systems with auto- 
mated tools, teleoperated manipulators, and free-flying 
robots. 


During and after construction, there should be a robot on 
standby for rescue operations. An astronaut drifting into space 
could be brought back by a free-flying robot. Such devices 
could also be on stand-by alert on the ground. The delivery 
systems for these rescue robots need not be man-rated. They 
can deliver expendable life support systems or encapsulate the 
astronaut in a life support environment for return to a shuttle, 
space station, or Earth. They could also perform first-aid 
functions. 

Another phase of space industrialization calls for a lunar 
or asteroidal base. After a surface site survey with robot (rover) 
vehicles, an automated precursor processor system could be 
placed on the Moon or the asteroid. This system would collect 
solar energy and use it in experimental, automated physical/ 
chemical processes for extracting volatiles, oxygen, metals, 
and glass from lunar soil delivered by automated rovers (Fig- 
ure 4-10). The products would be stored, slowly building up 
stockpiles in preparation for construction. The lunar or 
asteroidal base would be built using automated equipment and 
robots as in Earth orbit. After construction, general-purpose 
robot devices would be necessary for maintenance and repair 
operations. In addition, the base would use industrial automa- 
tion (qualified for operation in space) or a sort generally 
similar to those employed on Earth for similar tasks. 



Figure 4-10. Automated material processors on the lunar surface are 
serviced by robot vehicles with raw lunar soil. 
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Section V 

Technological Opportunities 


A. Trends in Technology 

Machine intelligence and robotics are not only relevant but 
essential to the entire range of future NASA activities. Content 
analysis of Earth orbital and planetary spacecraft results is 
merely one application. Other applications exist: in mission 
operations, in spacecraft crisis management, in large construc- 
tions in Earth orbit or on the Moon, and in mining in the lunar 
or asteroidal environments. These last applications are proba- 
bly at least a decade into the future, but some essential prepa- 
rations for them would seem prudent. These preparations 
might include the development of teleoperators, manipulative 
devices which are connected via a radio feedback loop with a 
human being, so that, for example, when the human on the 
Earth stretches out his hand, the mechanical hand of the 
teleoperator in Earth orbit extends likewise; or when the 
human turns his head to the left, the teleoperator’s cameras 
turn to the left so that the human controller can see the 
corresponding field of view. Where the light travel times are on 
the order of a tenth of a second or less, the teleoperator mode 
can work readily. For repetitive operations, such as girder 
construction and quality control in large space structures, 
automation and machine intelligence will play a major role in 
any efficient and cost-effective design. 

In planetary exploration lin the outer solar system, the 
light-travel times range from tens of minutes to many hours. 
As a result, it is often useless for a spacecraft in trouble to 
radio the Earth for instructions. In many cases, the instruc- 
tions will have arrived too late to avoid catastrophe. Thus, the 
Viking spacecraft during entry had to be able to monitor and 
adjust angle of attack, atmospheric drag, parachute deploy- 
ment, and retro-rocket firing. Roving vehicles on Mars, Titan, 
and the Galilean satellites of Jupiter will have to know how to 
avoid obstacles during terrain traverses and how not to fall 
down crevasses. The development of modem scientific space- 
craft necessarily involves pushing back the frontiers of ma- 
chine intelligence. 

In our opinion, machine intelligence and robotics is one of 
the few areas where spinoff justifications for NASA activities 
are valid. In most such arguments, socially useful applica- 
tions, such as cardiac pacemakers, are used to justify very 
large NASA expenditures directed toward quite different 
objectives. But it is easy to see that the direct development of 
the application, in this case the pacemaker, could have been 
accomplished at a tiny fraction of the cost of the activity 


which it is used to justify — the Apollo program, say. How- 
ever, because there is so little development in machine intelli- 
gence and robotics elsewhere in the government (or in the 
private sector), spinoff arguments for NASA involvement in 
such activities seem to have some substantial validity. In the 
long term, practical terrestrial applications might include 
undersea mineral prospecting and mining, conventional mining 
(of coal, for example), automated assembly of devices, micro- 
surgery and robotics prosthetic devices, the safe operation of 
nuclear power plants 6 or other industries which have side 
effects potentially dangerous for human health, and household 
robots. A further discussion of future NASA applications of 
machine intelligence and robotics, and possible spinoff of 
these activities, is given in the supporting documentation. 

With the development of integrated circuits, microprocessors, 
and silicon chip technology, the capabilities of computers have’ 
been growing at an astonishing rate. Figures 5-1 through 5-4 
provide estimates of recent past and projected future devel- 
opments. By such criteria as memory storage, power effi- 
ciency, size and cost, the figures of merit of computer systems 
have been doubling approximately every year. This implies a 
thousand-fold improvement in a decade. In another decade 
the processor and memory (four million words) of the IBM 

An interesting possible application of general purpose robotics tech- 
nology is provided by the nuclear accident at the Three Mile Island 
reactor facility near Harrisburg, Pennsylvania in March/April 1979. 
The buildup of a high pressure tritium bubble had as one possible 
solution the turning of a valve in a chamber under two meters of water 
impregnated with very high radiation fluxes. This is an extremely 
difficult environment for humans, but a plausible one for advanced 
multipurpose robots. The stationing of such robots as safety devices 
in nuclear power plants is one conceivable objective of the develop- 
ment ot robotics technology. Generally, such multipurpose robots 
might be stationed in all appropriate industrial facilities where signi- 
ficant hazards to employee or public health or to the facility itself 
exists. 

Shortly alter the Three Mile Island reactor accident the operating- 
company began recruiting '‘jumpers,” individuals of short stature 
willing, for comparatively high wages, to subject themselves to high 
radiation doses thought inappropriate for permanent reactor tech- 
nicians (New York Times , July 16, 1979, page 1). The functions are' 
often no more difficult than turning a bolt, but in a radiation environ- 
ment of tens of rems per hour. There would appear to be strong 
humanitarian reasons for employing small multipurpose self-propelled 
robots for this function, as well as to redesign nuclear power plants to 
make much fuller use of the capabilities of machine intelligence. The 
competent use of machine intelligence and robotics is an important 
component of all recently proposed additional energy sources — for 
example, mining and processing shale and coal. 
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Figure 5-1. Data storage technology. The storage capacity 
is doubling every 1-1/2 years, whereas the cost 
of random access memory is halving every 
2-1/2 years. In 1960, the equivalent of 
1 m^ stored a 15-page pamphlet; in 1980, the 
same space will accommodate a 2000-book 
library and in 1990, the entire Library of 
Congress. 
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Figure 5-3. Bubble memory technology. About 4 x 10® bits/cm2 
would be reached in 1985. This implies a bubble 
diameter of 10”5 C m, which is ten times greater 
than the theoretical limit. (Adapted from 
A.H. Bobeck, Bell Laboratory, ELECTRO 77, 

N. Y.). 



Figure 5-2. Active devices technology. The number of active 

components per cubic centimeter is doubling every 
1-1/8 years, whereas the average cost per logic gate 
is halving every 2-1/2 years. 
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Figure 5-4. Computer systems technology. The average increase of 
computer speed is doubling every 1-1/2 years, whereas 
the failure rate is halving every 2-3/4 years. 


370/168 will probably be houseable in a cube about five centi- 
meters on a side (although computer architecture different 
from that of the IBM 370/168 will probably be considered 
desirable). It is difficult to think of another area of recent 
technology which has undergone so many spectacular improve- 
ments in so short a period of time. 

This steep rate of change in computer technology is one 
major factor in the obsolescence of NASA computer systems. 
New systems are being developed so fast that project scientists 
and engineers, mission directors, and other NASA officials 
have difficulty discovering what the latest advances are, much 
less incorporating them into spacecraft-mission or ground- 
operations design. 

Another problem is the competition between short-term 
and long-term objectives in the light of the NASA budget 
cycle. Major funding is given for specific missions. There is a 
high premium on the success of individual missions. The safest 
course always seems to be to use a computer system which 
has already been tested successfully in some previous mission. 
But most missions have five- to ten-year lead times. The net 
result is that the same obsolete systems may be flown for a 
decade or more. This trend can be seen in areas other than 
computer technology, as, for example, in the NASA reliance 
in lunar and planetary exploration for 15 years on vidicon 
technology, well into a period when commercial manufac- 
turers were no longer producing the vidicon systems and 
NASA was relying on previously stockpiled devices. This has 
been the case since 1962. Only with the Galileo mission, in 
1984, will more advanced and photometrically accurate 
charged-coupled device systems be employed. The problem is 
much more severe when it applies to a field undergoing such 
dramatic advances as computer technology. The management 
dynamics can be understood, but it is nevertheless distressing 
to discover that an agency as dependent on high technology as 
NASA, an organization identified in the public eye with effec- 
tive use of computer technology, has been so sluggish in adopt- 
ing advances made more than a decade earlier, and even 
slower in promoting or encouraging new advances in robotics 
and machine intelligence. 

The general technological practice of adopting for long 
periods of time the first system which works at all rather than 
developing the optimal, most cost-effective system has been 
amply documented. 7 This phenomenon is by no means 
restricted to NASA. The need to handle radioactive substances 
led many years ago to the development of rudimentary tele- 
operators. At first progress was rapid, with force reflecting, 
two-fingered models appearing in the early 1950s. But this 
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development all but stopped when progress was sufficient to 
make the handling of radioactive materials possible — rather 
than easy, or economical, or completely safe. This occurred in 
part because the nuclear industry, like NASA, became 
mission-oriented at this time. Since then, the development of 7 
computer-controlled manipulators has proceeded slowly on 
relatively sparse funding, and there has been little drive to 
understand in a general and scientific way the nature of 
manipulation. Major advances seem similarly stalled and like- 
wise entirely feasible in such areas as locomotion research, 
automated assembly, self-programming, obstacle avoidance 
during planetary landfall, and the development of spacecraft 
crisis analysis systems. 

B. Relevant Technologies 

The principal activity of the Study Group during its 
existence was to identify machine intelligence and robotics 
technologies that are highly relevant to NASA and the success 
of its future programs. Each Study Group workshop had one 
or more of these topics as the foci of interest. Appendix A 
gives a complete list of topics covered at each of the 
workshops. In this section we provide a summary of the 
discussions of the topics considered by the Study Group. 

1 . Robotics Technology 

Robotics and machine intelligence have in the past played 
surprisingly small roles in NASA space programs and research 
and development. Yet these areas will become increasingly 
more important as the emphasis shifts from exploration 
missions to missions involving space utilization and industriali- 
zation and the fabrication and assembly of space structures. 
The high cost of placing people in space suggests that the use 
of robots might be the method of choice long before robotics . 
become practical on Earth. 

The uses of robotics can be broadly grouped into manipu- 
lators and intelligent planetary explorers. There already exist 
automatic vision and manipulation techniques that could be 
developed into practical systems for automatic inspection and' 
assembly of components. Parts could be marked to allow 
simple visual tracking programs to roughly position them, 
while force-sensing manipulators could mate the components." 
Where large structures, designed from standard sets of compo- 
nent parts and assembled in regular patterns are concerned, 
manipulators could perform reliable, accurate, repetitive 
operations which would be difficult for a human, in space, to 
do. Intelligent robot explorers will become imperative, if 
sophisticated large-scale interplanetary exploration is to 
become a reality. The round-trip communication delay time 


which ranges from a minimum of nine minutes to a maximum 
of forty minutes for Mars, and the limited “windows” during 
which information can be transmitted, precludes direct control 
from Earth. Thus the human role must be that of a supervisor 
and periodic program-updater of the computer. A robot Mars 
explorer must be equipped with sensors and appropriate 
computing capability which maximizes both efficient mobility 
and intelligent data gathering. 

The Study Group recommends that NASA take an active 
role in developing the necessary robotics technology, including 
rovers and manipulators , rather than expecting this technology 
to be transfered from other sources. 

2. Smart Sensor Technology 

There are several additional areas within NASA applications 
and mission programs which would benefit from advances in 
machine visual perception. These areas include remote sensing 
and crop survey, cartography and meteorology, teleoperators, 
and intelligent robot explorers. 

Current crop census systems do not seem to meet the 
expectations of lowered cost and increased repeatability from 
automated classification. It also appears that the 80-meter 
resolution per pixel of LANDSAT imagery is insufficient for 
current structural pattern recognition and scene analysis 
techniques. What is heeded is an emphasis on sensors whose 
resolution is between 2 meters and 2 centimeters per pixel. 
Coarse sampling (2 meters) would separate field boundaries, 
while finer resolution (2 centimeters) could be used to 
perform structural analysis on limited parts of the fields. 

Much work is yet to be done in computer stereo vision. 
Such systems will find applications in the automation of 
cartographic processes. While research into stereo vision have 
produced systems which work in a research environment, 
support is needed for newer high performance systems. 
Teleoperators and manipulators for fabrication and assembly 
of materials in space will require a vision system containing 
smart sensors which provide stereo presentations and the 
ability to generate multiple views. The quality of visual 
components in a teleoperator system will determine its utility 
as much as its mechanical sophistication. Intelligent robot 
explorers will rely on smart sensor visual systems in order to 
navigate and recognize interesting features to sample. Laser 
ranging devices offer minimal navigational aid due to their 
limited range capability. Stereo vision systems based on 
motion parallax offer superior capabilities by navigating with 
respect to distant landmarks. It would thus be possible to 
avoid difficult terrain and to return to locations of interest. 


The Study Group recommends that NASA expand and 
diversify its image processing research to include knowledge 
guided interpretation systems and initiate development of 
LSI-based smart sensors capable of both signal-based and 
symbolic interpretation. 

3. Mission Operations Technology 

It appears that significant cost-effective performance can 
also be realized by the application of machine intelligence 
techniques to mission planning and sequencing operations. 
These operations tend to be time-critical during space missions 
and require many repetitive and routine decision-making roles 
currently performed by human operators. Mission planning 
and control facilities dealing with data collection, experimen- 
tation scheduling, and monitoring should be automated to a 
much larger degree. Various missions may share many com- 
mon requirements which could be served by a software facility 
providing for mission-independent aspects of data collection 
and allowing embedding of mission-specific, task-oriented 
software. 

The Study Group recommends that NASA begin the 
development of a reusable , modular intelligent mission control 
center with the goal of increasing the mechanization and 
standardization of sequencing, data handling and delivery , and 
related protocols. 

4. Spacecraft Computer Technology 

Digital computers have been playing an ever increasing role 
in NASA space missions as the need to control and coordinate 
sophisticated sensors and effectors grows. They are destined to 
play a dominant role in future space missions. There are 
several issues, to which NASA should address itself, which bear 
on the ability of current space-qualified computers to support 
robotic devices requiring large central processors and memory. 

Specifically, fault tolerant designs, large scale integrated 
circuits, and computer architectures should receive attention 
by NASA. Fault tolerance implies that expected computer 
system behavior should continue after faults have occurred. 
Fault tolerance is essential to space missions since it is 
impossible to adequately test each component of the total 
system. Techniques for building reliable systems should 
include the ability to isolate the effect of a fault to a single 
module and to detect the fault so automatic recovery 
algorithms can be invoked to “repair” the fault. 

LSI technology holds the promise of more powerful, 
sophisticated computers with smaller power and weight 
requirements. However, since technology is rapidly advancing, 


the effective use of LSI systems may be severely blunted by 
the time requirements of space qualification. NASA must 
avoid committing to architectures prematurely. The adoption 
of a family of space-qualified computers would allow software 
to be developed and hardware decisions to be deferred 
allowing for more cost-effective and powerful technologies. 
There are many architectural alternatives for space computers: 
distributed, centralized, and network implementations. A 
distributed processor system is attractive from a management 
point of view since it provides separation of functions. In 
situations where there are special timing requirements for 
intelligent devices or sensors, the dedication of processors to 
these devices may be appropriate. However, in order to 
support robotic devices, much larger centralized computer 
systems, possibly with peripheral memories, will be required. 
This is an important area for study since spacecraft computer 
technoiogy will to a large part determine the sophistication 
and success of future missions. 

The Study Group recommends that NASA plan to test and 
space-qualify LSI circuits in-house to reduce the apparent 
factor of 5 or 10 increase in cost of industry supplied 
space-qualified microprocessors and memories . Further ; the 
Study Group believes that NASA should play an active role in 
encouraging the development of flexible computer architec- 
tures for use in spacecraft 

5. Computer Systems Technology 

Current trends in the use of computer technology through- 
out NASA seriously impede NASA utilization of machine 
intelligence. Distributed processing techniques being adopted 
by NASA takes advantage of microcomputer technology to 
develop intelligent sensors and controllers of instruments. 
While microprocessors are well suited for simple sensing and 
controlling functions, many of the essential functions involv- 
ing the use of machine intelligence and robotics technique 
require much larger processors. A flexible spacecraft computer 
architecture, within which both microprocessors and larger 
systems can coexist and communicate and cooperate with each 
other, seems to be a highly desirable goal for NASA. 

The standardization of computer hardware which is 
intended to reduce costs by avoiding new hardware develop- 
ment and space qualification may result in the use of obsolete 
hardware. This will limit the resources available for a machine 
intelligence system, and possible preclude any effective imple- 
mentations. NASA should look at developing techniques for 
software portability, or, equivalently, hardware compatibility 
in a family of machines. The desire to minimize software 
complexity may unnecessarily restrict experimental machine 
intelligence systems. Part of the problem rests with the issues 


of protection and reliability. NASA should reevaluate its 
hardware systems in light of recent techniques for providing 
resource sharing and protection in centralized systems. 


The Study Group recommends a “software-first” approach 
to computer systems development within NASA so that 
hardware can be supplied as late as possible in order to take 
advantage of the latest technological advances . 

6. Software Technology 

The method of software development within NASA is in 
striking contrast to program development environments that 
exists in several laboratories working on machine intelligence. 
Compared with other users of computer technoiogy, such as 
military and commercial organizations, NASA appears to be 
merely a state-of-the-art user. But compared with software 
development environments found in universities and research 
institutes there is a significant technological lag. The technol- 
ogy lag represented by this gap is not NASA’s responsibility 
alone. The gap is indicative that an effective technology 
transfer mechanism does not yet exist within the computer 
field. 

Software developed within NASA is often done in a batch 
environment using punched cards, resulting in a turnaround 
time of hours or even days. In contrast, the machine 
intelligence laboratories are characterized by being totally 
on-line and interactive. While debugging in a batch environ- 
ment is a purely manual operation, requiring modification of 
the source program via statements to display internal values 
and intermediate results, many more programming aids are 
available in an interactive laboratory environment. Changes to 
programs are automatically marked on reformatted listings, 
the author and date of the changes are recorded, and the 
correspondence between source and object modules is main- 
tained. In addition, extensive debugging and tracing facilities 
exist including interactive changing the programs data and 
restarting it from arbitrary checkpoints. The investment made 
to substitute computer processing for many manual activities 
of programmers should ultimately result in improved software 
quality and programmer productivity. 


It should be emphasized that improved software develop- 
ment facilities can be created within NASA through the 
transfer and utilization of existing computer science technol- 
ogy. However, further improvements necessitate advances in 
the field of automatic programming which is an area of 
machine intelligence where programming knowledge (i.e., 
knowledge about how programs are constructed) is embedded 


within a computer tool that utilizes this knowledge to 
automate some of the steps which would otherwise have to be 
manually performed. This is an area which deserves attention 
by NASA, perhaps towards developing specialized automatic 
programming systems tailored to NASA’s needs. 

The Study Group recommends immediate creation of an 
interactive programming environment within NASA and the 
adoption of a plan to use a modem data-encapsulation 
language (of the DOD ADA variety) as a basis of this facility. 
The Study Group also believes that NASA should initiate 
research towards the creation of automatic tools for software 
development. 

< 

7. Data Management Systems Technology 

There are several data mangement issues where artificial 
intelligence techniques could be brought to bear. These areas 
range from the control of data acquisition and transmission, 
data reduction and analysis, and methods for dissemination to 
users. For example, onboard computers should perform data 
reduction and selective data transmission. This will minimize 
the amount of data transmitted and conserve communication 
channels and bandwidth. This requires an advanced computer 
capable of various types of data analysis. Once the data 
reaches a ground collection site, there are three types of data 
management functions required to make the data accessible 
and usable to researchers. First, the data must be archived. 
This is the simplest type of management which does not 
involve analysis of the data itself. For example, “Retrieve all 
data for the fifth orbit of the Viking mission.” Secondly, 
access to specific portions or collections of the data, locating 
predetermined criteria such as “all infrared images centered 
over Pittsburgh taken between June and September of 1978” 
must be provided. Both archival and criteria selection manage- 
ment systems are well within current technology, and to some 
extent are available in systems similar to those at the EROS 
data center in Sioux Falls. However, the third type of database 
management function, the ability to access data by its content 
does not yet exist, and requires specific artificial intelligence 
support. It would utilize a knowledge base containing specific 
facts about the data, general rules concerning the relationships 
between data elements, and world models into which complex 
requests can be evaluated. This knowledge base would guide 
the system in locating data containing the desired attributes 
utilizing ’a predefined indexing criteria and the relationship of 
the desired attributes to the indexing attributes. 

The Study Group recommends reexamination and evalua- 
tion of the NASA end-to-end data management system and the 
establishment of a systems engineering group consisting of 


computer scientists and hardware experts to achieve an 
effective system design and implementation. 

8. Man-Machine Systems Technology 

For both ground- and space-based NASA systems we would 
like to have the best integration of human intelligence and 
machine intelligence; but we lack an understanding of how 
best to combine these natural and artificial components. For 
example, to be more effective in the use of teleoperators, 
NASA needs to redress a basic lack of knowledge: there now is 
no satisfactory theory of manipulation on the basis of which to 
improve design and control of manipulators. The relative assign- 
ment of roles to man and computer and the design of the related 
interfaces require much better understanding than now exists. 

In view of potential long-range payoff and the fact that 
such related research as exists within NASA has been ad hoc 
and mission-oriented, the Study Group recommends support 
of significantly more basic research on man-computer coopera- 
tion , and , more generally , on man-machine communication 
and control NASA organizational entities representing life 
sciences and the technological disciplines of computers and 
control should develop better cooperative mechanisms and 
more coherent programs to avoid man-machine research 
“falling between the cracks,” as has been the case. Future 
NASA missions can have the advantages of human intelligence 
in space, without the risks and life support costs for 
astronauts, by developing teleoperators with machine intelli- 
gence, with human operators on Earth monitoring sensed 
information and controlling the lower-level robotic intelligence 
in supervisory fashion. 

9. Digital Communication Technology 

Computer based communication systems have been used by 
the artificial intelligence community since the inception of the 
ARPANET network which is now used under NSF support to 
link approximately 500 non-computer scientists in about eight 
different research communities. These systems provide elec- 
tronic mail (using distribution lists) and communication, and 
are used to give notices and reminders of meetings and reports. 
Online documentation of programs with instant availability to 
updated versions allow users access to information and 
programs at a variety of research sites. In addition, document 
preparation services including text editing systems, spelling 
correctors, and formatting programs are in common use. 
NASA would do well to adopt a computer based communica- 
tion system since it would offer opportunities for improve- 
ments in management, planning, and mission implementation. 
If the system were a copy of existing systems at research sites 
on the ARPANET, software could be taken directly from 
those systems. 
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Appendix on Relevant Technologies 


The principal activity of the Study Group during its 
existence was to identify information processing technologies 
that are highly relevant to NASA and to the success of its 
future programs. Each workshop had one or more of these 
topics as the foci of interest. Appendix A gives a complete list 
of topics covered at each of the workshops. In this section we 
provide detailed discussions of those topics which are consid- 
ered by the Study Group to be of high priority for NASA. 

1. Robotics Technology 

This section discusses the need for advanced development 
of intelligent manipulators and sensors. The application areas 
for these devices range from the assembly of space structures 
to planetary rovers capable of autonomous execution of highly 
sophisticated operations. Research in the areas of robotics and 
artificial intelligence is necessary to ensure that future missions 
will be both cost-effective and scientifically valuable. In 
addition, results in robotics and artificial intelligence are 
directly applicable in the areas of automatic assembly, mining, 
and exploration and material handling in hazardous 
environments. 

1.1 Need for Robotics Within NASA 

Robotics and artificial intelligence have played surprisingly 
small roles in the space program. This is unfortunate because 
there are a number of important functions they could serve. 
These include, very broadly: 

1. To enable missions that would otherwise be out of the 
question because of cost, safety, or feasibility for other 
reasons. Example: At rather low cost, we could have had 
a remotely-manned lunar explorer in progress for the 
past decade. 

2. To enable the kinds of popular and valuable features 
that might rekindle public interest in the exploitation 
and exploration of space. Example: In the past decade, 
the hypothetical lunar explorer just mentioned would 
have been operating for 1,000,000 five -minute intervals. 
In this period, a vast number of influential public visitors 
could have operated some of the Explorer’s controls, 
remotely, from NASA visitor centers. Imagine the 


education and enthusiasm that could come from suclTa 
direct public participation in space! 

3. To achieve general cost reductions from efficient auto- 
mation. Example: The Skylab Rescue Mission would 
have been a routine exercise, if a space-qualified tele- 
operator had been developed in the past decade. It 
would have been a comparatively routine mission to 
launch it on a military rocket if the Shuttle project 
encountered delays. 

These things have not been done, in part, because NASA 
has little strength at present in the necessary technical areas. In 
our view the future prospects seem poor unless there is a 
change. We see several obstacles: 

In-House Competence. NASA’s current strength in artificial 
intelligence is particularly low. NASA’s in-house resources are 
comparatively weak, as well, in computer science on the whole", 
especially in areas such as higher-level languages and modem 
debugging and multiprocessing methods. 

Self-Assessment. Even more serious, NASA administrators 
seem to believe that the agency is outstanding in computation 
science and engineering. This is far from true. The unawareness 
of weakness seems due to poor contact of the agency’s 
consultants and advisors with the rest of the computational 
research world. 

Superconservative Tradition. NASA has become com- 
mitted to adhere to the concept of very conservative, fail-safe 
systems. This is eminently sound in the days of Apollo, when 
(i) each successful launch was a miracle of advanced tech- 
nology and (ii) the lives of human passengers were at stake. 
But today, we feel, that strategy has become self-defeating, 
leading to unnecessarily expensive and unambitious projects.- 

Fear of Complexity. On a similar note, we perceive a broad 
distrust of complicated automatic machinery in mission 
planning and design. This distrust was based on wise decisions 
made in the early days of manned space exploration, but it is 
no longer appropriate in thinking about modem computation. 
Instead of avoiding sophisticated computation, NASA should 
become masterful at managing and exploiting it. Large 
computers are fundamentally just as reliable as small 
computers. 
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Fear of Failure. Many NASA people have confided to the 
Study Group that the agency is afraid that any mission failures 
at all may jeopardize the whole space program, so that they 
“cannot take chances” in advanced design. Again, this attitude 
was sound in the Apollo era, but probably is not sound when 
we consider the smaller, multiple, and individually inexpensive 
missions of today. 

What Are the Alternatives? We feel that NASA should begin 
to consider new styles of missions which are, at the same time, 
more adventurous and less expensive. Left as it is, NASA’s 
thinking will continue to evolve in ways that will become 
suffocatingly pedestrian. To get out of this situation, it will be 
necessary to spend money, but the amount needed to learn to 
do exciting things like using powerful computers and semi- 
intelligent robots will be small compared to the money needed 
in the past for developing propulsion systems. “Getting there” 
is no longer all the fun; it is time to think about how to do 
sophisticated things after the mission arrives there. 

Space Programs and Intelligent Systems. It is extremely 
expensive to support personnel in space for long periods. Such 
costs will render impossible many otherwise exciting uses of 
space technology. Yet, our Study Group found relatively little 
serious consideration of using autonomous or semi- 
autonomous robots to do things in space that might otherwise 
involve large numbers of people. In many cases, the use of 
artificial intelligence had not been considered at all, or not 
considered in reaching conclusions about what computer 
resources will be needed, or prematurely dismissed on the basis 
of conversations with the wrong people. In other cases, it was 
recognized that such things were possible in principle, but out 
of the question because of NASA’s mission-oriented — as 
opposed to technology-oriented — way of planning for the 
future. 

Two examples come to mind as obvious illustrations of 
cases where we found the views expressed to be particularly 
myopic: 

(1) Building Large Space Structures. Large-scale construc- 
tions usually involves two activities. First, basic 
building blocks must be fabricated from stock material. 
Second, the building blocks must be assembled. Space 
fabrication seems necessary becuase of difficulty in 
launching large prefabricated sections. We applaud the 
work that NASA has done already toward creating 
machines that continuously convert sheet metal into 
beams. We are less happy with the lack of justification 
for automatic inspection and assembly of such beams. 
There are existing automatic vision and manipulation 
techniques that could be developed into practical 
systems for these tasks. The beams could be marked. 


during fabrication, so that descendants of today’s visual 
tracking programs could do rough positioning. And, 
force-sensing manipulators could mate things together, 
once roughly positioned. Where large structures are 
concerned, in fact, these are areas in which reliable, 
accurate, repetitive human performances would be very 
hard to maintain. 

(2) Mining. An ability to build structures is probably a 
prerequisite to doing useful, economically justified 
mining on the Moon, the planets, and the asteroids. But 
the ability to build is only a beginning. The vision and 
manipulation problems that plague the robot miner or 
assembler are different. Rocks do not have fiduciary 
marks, and forces encountered in digging and shoring 
are less constrained than those involved in screwing two 
parts together. On the other hand, less precision is 
required, and even interplanetary distances do not 
prevent the exchange of occasional questions and 
return suggestions with Earth-based supervisors. 


1.2 The State of the Art 

At this point, we turn to some specific areas, both to draw 
attention to NASA’s special needs and to tie those needs to 
the state of the art. 

Basic Computer Needs. A first step toward enabling the use 
of artificial intelligence and other advanced technologies is to 
use more sophisticated computer systems. We conjecture that 
the various benefits that would follow from this approach 
could reduce the cost of spacecraft and ground-based opera- 
tions enough to make several missions possible for the present 
cost of one. 

We want to emphasize this point strongly, for we note a 
trend within NASA to do just the opposite! In our Study 
Group meetings with NASA projects over the year, time and 
time again we were shown “distributed” systems designed to 
avoid concentrating the bulk of a mission’s complexity within 
one computer system. However, we feel that this is just the 
wrong direction for NASA to take today because computer 
scientists have learned much about how to design large 
computer systems whose parts do not interact in uncontrol- 
lably unpredictable ways. For example, in a good, modern 
“time-sharing system” the programs of one user - however 
badly full of bugs - do not interfere either with the programs 
of other users or with the operation of the overall “system 
program.” Thus, because we have learned how to prevent the 
effects of bugs from propagating from one part to another, 
there is no longer any basic reason to prefer the decentralized. 
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“distributed” systems that became the tradition in the 
“fail-safe” era of engineering. 

However, because NASA has not absorbed these tech- 
niques, it still distrusts centralization of computation. We 
argue elsewhere that this leads to very large and unnecessary 
costs of many different kinds. 

The Development of Sophisticated Manipulators. We feel 
that NASA has not adequately exploited the possibilities of 
even simple man-controlled remote manipulators. The Skylab 
sunshade episode might well have been easily handled by an 
onboard device of this sort, and we think it likely that it 
would have paid for itself in payload by replacing some 
variety of other special-purpose actuators. 

The need to handle radioactive substances led to the 
development of rudimentary teleoperators many years ago. At 
first progress was rapid, with force-reflecting, two-fingered 
models appearing in early 1950s. But, strangely, this develop- 
ment all but stopped when progress was sufficient to make the 
handling of nuclear materials possible, rather than easy, 
economical, and completely safe. We believe that this hap- 
[xmed because the nuclear industry, like NASA, became at this 
time mission-oriented rather than technology oriented - so 
that places like Argonne National Laboratory lost their basic 
research and long- view funding. 

Consequently, today manipulators differ little from their 
1950s ancestors. They are still two-fingered and they still leave 
their operators fatigued after a half-hour or so of use. Even 
today, there is no generally available and reliable mobile and 
dexterous manipulator suitable for either emergency or pre- 
ventive maintenance of nuclear plants - this is still done by 
people working under extremely hazardous conditions. Con- 
cerns within a nuclear plant about storage safety, detection of 
faults, and adequacy of emergency systems are perhaps best 
handled using a mobile and dexterous robot. 

If such devices had been developed - and space-qualified 
versions produced - NASA could have exploited them, both 
for teleoperator (human-controlled) and for fully autonomous 
(robot) use. Indeed, we feel, NASA’s needs in this area are 
quite as critical as those in the nuclear industry. Nevertheless, 
NASA has not given enough attention to work in the area.’ 
Perhaps a dozen or more clumsy two-fingered systems have 
been developed, but all of these would be museum pieces had 
the work gone at proper speed. 

It therefore makes sense for NASA to enter into a 
partnership with ERDA to reverse the neglect of manipulator 
technology. A good start would be to sponsor the develop- 
ment of a tendon-operated arm with a multifingered hand, 


both heavily instrumented with imaginative force, touch 
sensors, and proximity vision systems. Besides the obvious 
value - in space - of separating the man and his life-support 
problems from the workspace, there are many obvious spinoffs 
in general manufacturing, mining, undersea exploitation, 
medicine (micro-teleoperators), and so forth. 

Controlling a Manipulator: Still a Research Problem. 
Dynamic control of the trajectory of a many-jointed manipu- 
lator seems to require large calculations, if the motion is to be 
done at any speed. It takes six joints to put a hand at an 
arbitrary place at an arbitrary orientation, and the six degrees 
of freedom have interactions that complicate the dynamics of 
arm control. The equations are too complex for straight- 
forward real-time control with a low-capacity computer. The 
problem can be simplified by placing constraints on manipu- 
lator design, for example by designing the axes of rotation of 
the last three joints to intersect, but even the simplified 
problem is not yet solved. 

In any case, the most obvious approach — to put an 

independent feedback control loop around each joint fails 

because constant feedback loop gains cannot manage (at high 
speeds) the configuration-dependent inertia terms or the 
velocity interaction terms. On the other hand, it seems clear 
that such problems can be solved by combinations of “table 
look-up” for sample situations with correctional computa- 
tions. In any case the control computer will need a central 
memory that is large by today’s space standards. 

Rover Mobility, Locomotion, and Guidance Research. 
Although much knowledge regarding several of the solar 
system planets has been gained through missions employing 
remote sensors, and more can be obtained in the future in this 
manner, many of the critical scientific questions require 
detailed surface experiments and measurements such as those 
conducted by the Viking landers on Mars. Despite the historic 
achievement represented by the soft landing of the Vikings 
and the effectiveness of the onboard experimental systems, 
more new important questions were raised. For these to be 
answered, an extensive surface exploration should be under- 
taken. A surface trajectory involving hundreds of kilometers, 
and desirably over 1000 kilometers, would be required to 
explore a sufficient number of the science sites on Mars to gain 
an adequate coverage of the planet. 

The round-trip communications delay time, which ranges" 
from a minimum of nine minutes to a maximum of forty 
minutes for Mars, and the limited “windows” during which 
information can be transmitted precludes direct control of the 
rover from Earth. Accordingly, a rover on Mars or another 
planet must be equipped with sensors and appropriate com- 
puting capability and procedures to proceed autonomously 
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along Earth-specified trajectories. The intelligence of this path 
selection system, together with the basic mobility character- 
istics of a rover, determine whether scientific sites of specific 
interest can be reached, given the characteristics of the 
approach terrain and the distances between sites. It follows 
that a low-mobility rover equipped with a high-quality path 
selection system will not be able to reach particular sites nor 
could it undertake an extensive mission. It also follows that a 
high-mobility rover guided by a low-quality path selection 
system would be limited in a similar fashion. Therefore, 
systematic research programs aimed at maximizing both the 
rover mobility and the intelligence in path selection systems 
consistently should be undertaken to provide a sound basis for 
the planning and execution of surface exploration of solar 
system bodies. - 

The term “mobility” includes several characteristics which 
when taken together describe collectively the capability of the 
rover to deal with specific classes of terrain. 

1. The stability of the rover in terms of the in-path and 
cross-path (i.e., pitch and roll) which the rover can 
handle without the hazard of overturning. This charac- 
teristic is not only important in terms of the general 
slope characteristic of the terrain surface, but especially 
in connection with boulders and trenches on which 
individual propulsion elements may find temporary 
purchase (foothold). 

2. The maneuverability of the rover, i.e., the turning radius 
and dynamical characteristics, will determine what ter- 
rains in the large sense will be open for exploration. 
Unless the rover is able to execute strong turning 
trajectories and maneuver in close quarters, many areas 
will be forbidden. 

3. Clearance of the payload above the propulsion units will 
have a direct effect on the available paths. A rover whose 
clearance is adjustable will not only offer prospects for 
recovery should the rover become hung-up but may also 
offer additional scientific capabilities. Finally , an adjust- 
able clearance would allow for the rover s center of 
gravity to be reduced temporarily in situations where the 
critical pitch/roll conditions are approached to increase 
safety or to permit the rover to follow a normally unsafe 
terrain. 

4 _ The rover’s speed capabilities will have a direct effect on 
the scope of the time required for the traverse between 
specified science sites. 

5. Locomotion is a very major factor since it exerts a 
primary limit as to what terrains can be handled. The 


three major alternatives available, wheels, tracks and 
legs, not only offer varied propulsive and maneuver- 
ability capabilities as well as potential sensor informa- 
tion for guidance, but also pose unique as well as general 
control problems. 

With respect to the propulsive and maneuverability factors, 
wheels and tracked units can be designed to achieve the 
required footprint pressures and traction required to deal with 
soft, loose materials such as ultrafine sand as well as hard 
coherent terrain forms such as boulders. Wheels have the 
advantage of being able to change direction with a minimum 
of scuffing and to tolerate small obstacles in lateral motion. 
The tracked units have the advantage of being able to bridge 
larger trenches but offer potential problems in turning on 
irregular terrain. 

Neither the potential nor the limitations of such concepts 
have been firmly established and a systematic research and 
development program would appear to be in order. Such a 
program should be aimed at developing maximum carry 
load-to-wheel weight ratios consistent with reliability, 
footprint pressure, turning capabilities, and dimension. 

A legged vehicle, which makes use of six or eight legs of 
varying joint complexity, would appear to offer decided 
advantages over wheeled or tracked vehicles in extremely 
rugged and irregular terrain. Depending on the number of 
segments and their lengths and the degrees of freedom 
provided by the connecting joints, a rover capable of dealing 
with extraordinarily irregular terrain and possessing excep- 
tional climbing ability is potentially feasible. Maneuverability 
and stability potential of such a rover could exceed that of 
wheeled or tracked rovers. However, the feet of such a device 
may raise a serious problem. Rather large feet would be 
required to provide a sufficiently low footprint pressure on 
soft or loose terrain. On the other hand, such big broad feet 
might seriously limit the rover’s capability in gaining a firm 
purchase on very irregular terrain. Research on legged vehicles 
has been very limited in the United States. At the present 
time, McGee at Ohio State University has an active hardware 
program. Considerable efforts are apparently underway in the 
Soviet Union but virtually nothing is known of the details of 
this work, other than that they are proceeding vigorously. 
Successful development of a legged vehicle would apply to 
environmentally delicate regions such as tundra as well as 
space exploration. 

The control of either wheeled, tracked, or legged rovers 
represent problems of substance which will require study. The 
wheeled or tracked vehicle control system will have to respond 
to constraints imposed by irregular terrains. In the case of 
locomotion on a flat plane, it is a straightforward matter to 


specify a vehicle speed and a steering angle to a computer- 
driven or hard-wired control system to drive each wheel at the 
proper speed to achieve the desired motion without scuffing 
and without excessive stresses either on the propulsion system 
or the vehicle structure. However, if the vehicle is on irregular 
terrain so that the axle velocity vectors are no longer coplanar, 
then each wheel must be driven at a specific rate to achieve the 
desired result. Wheel speed and torque as well as the vehicle 
strut position locations, possibly force or stress sensors, and 
the pitch/roll of the rover will have to be combined with 
trajectory parameters to achieve an acceptable system. 

The legged-vehicle control problem is of a different 
character. Certainly all the dynamic control problems dis- 
cussed above in connection with manipulation reappear. 
Moreover, additional problems come up. The gaits (sequences 
in which the legs are moved) -which are selected are a function 
of the terrain to be traversed and the desired speed. The 
specific motion of an individual leg may also be a function of 
the terrain. In the case of irregular terrain, a significant lift of 
the leg to avoid hazards will be required before the foot can be 
lowered to the desired position. Sensors and control systems 
controlling the motion will have to be developed. 

In order for the rover to autonomously execute highly 
sophisticated operations in an unpredictable environment, it 
must be capable of real-time interaction with sensory feed- 
back. It must be capable of selecting and modifying its 
behavior sequences in response to many different types of 
sensory information over a wide range of response times. For 
example, the mobility system should respond almost instan- 
taneously to pitch and roll accelerations, but may tolerate 
longer time delays as it picks its way around small rocks and 
ruts on a meter by meter basis. It should anticipate larger 
obstacles two to five meters ahead and impassable barriers 
should be detected 5 to 100 meters in advance. Minimum 
energy pathways along contour lines, through valleys, and 
between hills should be selected 0.1 to 1 km ahead, and long 
range navigational goals should be projected many kilometers 
ahead. 

Similarly with manipulation, position servo corrections 
must be applied with very short time delays, whereas feedback 
from proximity sensors can be sampled only a few times per 
second in order to modify approach path motions which move 
slowly over a distance of a few centimeters. Processing of 
feedback in order to select alternative trajectory segments 
during the execution of elemental movements is more com- 
plex, and can be done at still coarser time intervals. The 
modification of plans for simple tasks to accommodate 
irregularities in the environment, the modification of complex 
task plans, or changes in scenarios for site exploration require 
increasingly complex sensor analysis processes which can be 


safely carried out over longer time intervals. The most natural 
way to deal with this hierarchy of ascending complexity and 
increasing time intervals is to map it onto a computing 
mechanism with the same hierarchical structure. 

The important issue in the mobility control hierarchy is the 
wide range of time and distance scales to which the sensory 
data must interact with the mobility system. Some types of 
feedback must be incorporated into the control system with 
millisecond and centimeter resolution, while other feedback 
can be incorporated at intervals of days or kilometers. Only if 
the control system is hierarchically structured can such a wide 
range of resolution requirements be easily accommodated. 

Automatic Assembly and Force Feedback. The most naive 
concept of automation is to make a robot that will repeat 
pre-programmed motions over and over. This will not work in 
many situations; using position control alone, a robot cannot 
insert a fastener in a tight hole or even turn a crank — because 
the inevitable small errors would cause binding or breakage. 
Consequently, it is necessary for robot manipulators to use 
force-sensing feedback or the equivalent. 

In the 1960s, experimental systems demonstrated such 
methods for automatic assembly. Centers in Japan, the U.S.-, 
and the U.K. succeeded nearly simultaneously. In one such 
demonstration, Inoue, working at MIT, used an arm equipped 
with a force-sensing wrist designed by Minsky to assemble a ' 
radial bearing. Shortly thereafter researchers at the Draper 
Laboratory exhibited a device to do some kind of work with 
carefully arranged passive, compliant members replacing active 
force-sensing feedback loops. Using such techniques, we think 
that much of the automatic assembly of space structures 
already comes near to the state of this art. 

Automatic Assembly and Problem Solving Systems. 
Problem solving and languages for problem solving has been a 
central focus in artificial intelligence since the science began. 

In the earliest stages of AI, it was seen that a computer could 
be programmed to try a variety of alternatives, when it 
encountered a situation not specifically anticipated by the 
programmer. Soon these “heuristic search” programs were, 
succeeded by “goal-directed” problem-solvers, notably the 
GPS system of Newell and Simon at Camegie-RAND. The 
symbolic integration program by Slagle is perhaps the best 
known example from that era. 

Since that time, there has been a steady stream of new 
ideas, both for more general theories and for the design of 
problem solvers for particular problem domains. This work led 
to a variety of new computational organizations and languages 
LISP, PLANNER, CONNIVER, STRIPS, QA4, and Production 
Systems are representative of this conceptual evolution. The 


MYCIN program for bacteriological diagnosis and treatment, 
and the PARSIVAL program for analyzing English syntax are 
representative of what can be . done to attack small, 
well-defined domains. 


problem and even though everyone in the field has thought 
about the problem from time to time. 

1.3 Recommendations 


In the last few years, some steps have been taken to apply 
the resulting technology to the problem of assembly automa- 
tion. It would be impractical and unreliable to program 
assembly machines at a low level, giving step-by-step instruc- 
tions about exactly where to move the hand and exactly when 
to close it around an object. It is better - and easier in 
principle - to design languages with embedded problem- 
solving apparatus, so that the “programmer” can give instruc- 
tions in much the same way as one communicates with people. 
First, one states the general goal, names the parts, and suggests 
an order in which the parts should be put together. Later, one 
makes suggestions as they come to mind, or if and when the 
assembly machine gets stuck. 


Several research centers are now working on such problems, 
among them Stanford, SRI, IBM, NBS, and MIT A full 
solution is some distance off, but the work has the fortunate 
character that each step in basic progress yields a corre- 
sponding step in application. In early stages, the amount o 
suggestion and detaU supplied by the programmer is large, but 
the amount decreases as the problem solver gets smarter an 
knows more. 


We must re-emphasize two major obstacles to addressing 
the needs just outlined. The first is that of a fail-safe attitude. 
NASA pioneered in achieving extraordinary reliability in its 
fail-safe, redundant designs for missions. We have the impres- 
sion that the use of these techniques is persisting in new 
problems to the point of some dogmatism, overlooking new 
possibilities enabled by progress in computer technology. In 
particular, we believe a great increase in flexibility and 
reliability might be obtained through centralizing many 
operations within one computer. But we see an opposite 
tendency; to design multiple, “distributed” computer systems 
that limit the flexibility of the system. On the surface, this 
seems sensible; but we believe that it leads to overlooking 
other, more centralized ways to do things that may be 
cheaper, more versatile, and at least equally reliable. For 
example, one might imagine missions that depend utterly on 
one central computer and one manipulator to replace many 
special systems. Of course, one of these two components 
might fail and lose the mission. On the other hand, eventually 
such a system might be (1) an order of magnitude cheaper and 
(2) possibly more reliable - because of extensive concentra- 
tion on the two components and because of their ability to 
salvage or repair other failing components. 


Automatic Assembly and Vision. We are still far away from 
being able to make a computer “see” - to describe and 
recognize objects and scenes as well as a person can. In spite of 
much brillant work done in this field, “general-purpose 
computer vision” is still far away. Still, the special and 
controllable environments involved in manufacturing have 
enabled exciting demonstrations with near-term promise. One 
of these, done by Rosen and his colleagues at SRI, uses binary 
image processing techniques to identify parts and their 
orientation after they have been placed randomly on a light 
table. In other work, done under the direction of Horn at MIT, 
inspection programs have successfully examined watches to 
make sure the hands are moving, castings to make sure the 
grain structure is correct, and IC lead frames to make sure the 
pins are straight. We believe that this sort of work has great 
promise of enabling work in space that might otherwise never 
be done. Still, we emphasize that of all problems described 
here, computer vision is likely to prove the most difficult and 
most deserving of attention and funding. The successful 
examples cited are included only to suggest that there is a 
technology to be explored for potential uses within NASA, 
not that there is a technology that can be merely bought. 
There is, for example, no general systems for selecting parts 
from a bin, even though it is well-known to be a serious 


NASA’s second major problem is that its current strength is 
low in artificial intelligence and even in general computer 
science. There are few people within NASA who understand 
the state-of-the-art. There is no place where those who do 
artificial intelligence work can reach critical mass with respect 
to the number of high-quality researchers or with respect to 
computational and other supporting resources. This has led to 
three regrettable consequences. First, present NASA workers 
are unable to be maximally productive. Second, it is extremely 
difficult to attract talented people to NASA. And third, those 
people in NASA that most need advice on artificial intelligence 
do not find it. Instead, they incorrectly suppose that they 
must be in good hands because NASA spends a great deal of 
money on computation. 

This has led to a great gap. Much of what NASA does with 
computers is years outof-date. Worse, with only a few 
exceptions, influential people in NASA do not realize how 
out-of-date most of their thinking has become. In such areas as 
computer languages, the situation is nearly scandalous. Part of 
the problem has to do with mission-oriented horizons, and 
part with distrust of outside researchers. Because typical 
“Earth-bound” workers do not have such concern with 
reliability and simplicity, we conjecture that NASA mission 
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workers feel that the techniques of non-NASA people are 
inapplicable. Instead of working with AI and robotics projects 
outside, NASA has tended to try to build its own. But these 
projects have never reached critical mass and have not 
attracted enough first-rate workers. The problem is connected, 
again, with the lack of modem computing power; modem 
vision and robotic control concepts require large computer 
programs and memories. We believe that there is no reason 
such systems cannot be space-qualified, and that they need not 
be very heavy or power-hungry. But without them, it is hard 
to use modem ideas about control and operations. 

How to Correct the Central Problem of Insufficient 
Expertise. One idea is to contract with computer companies to 
provide advice and needed research. This idea, however, will 
not work. The large companies NASA is comfortable working 
with have not yet developed strength in artificial intelligence. 
NASA can only be led into a false sense of security by relying 
on them. Alternatively, NASA could increment its small 
existing budget for artificial intelligence and related topics, 
increasing the funds available at existing places. This also will 
not achieve the desired results. Indeed, such a plan could be 
counterproductive. The nature of the work demands a 
community of highly-motivated people working together. 
Efforts below critical mass in human or other resources are not 
likely to do well and such efforts could therefore lead to 
pessimism rather than excitement. 

Still another possibility is that NASA could fund university 
research. This is a reasonable alternative as long as it is a gain 
understood that small, subcritical efforts are not cost-effective. 
Only a half-dozen university centers have sufficient existing 
size and strength to do really well. And finally, NASA could 
establish its own center. This is a good choice, especially if 
done in close proximity to and collaboration with an existing 
university center. It is our opinion that the need for artificial 
intelligence in space argues for such a center in the strongest 
terms. We believe that artificial intelligence will eventually 
prove as important to space exploitation and exploration as 
any of the other technologies for which there are large, 
focused, and dedicated NASA centers today. 

Future NASA Role. At a certain level of abstraction, 
NASA s needs are not unique. Certainly such things as 
automated assembly and mining would be useful on Earth as 
well as in space. But it would be folly for NASA to expect 
someone else to produce the needed technology. NASA should 
plan to be the donator of artificial intelligence robotic 
developments rather than the benefactor for several reasons. 

First, not enough is happening for reasons ranging from the 
shape of our antitrust laws to the lack of congressional 


concern for our declining position in productivity. Second, the 
extreme cost of placing people in space ensures that using 
robots and/or teleoperators will be the method of choice in 
space assembly and mining long before robots see much action 
on Earth. Consequently, cost/benefit ratios will be more of a' 
driving force to NASA than to others. And third, doing things Z 
in space is sufficiently special that NASA must be in the act in 
a major way to ensure that the technology progresses with 
NASA’s interests in mind. Otherwise, all NASA will have is a 
technology that is solving someone else’s problems but skirting 
NASA’s. 


The Virtual Mission Concept - A Special Recommenda- 
tion. The establishment of research efforts, well endowed with 
human and financial resources, should be accompanied by a 
new kind of attitude about mission planning and development, 
particularly with respect to space qualification of hardware. As 
it stands today, work seems to be done in two primary 
contexts, that of the paper study and that of the approved, 
assumed-to-fly mission. This automatically ensures two 
crippling results. 

First, since the execution of a mission is very expensive, 
only a small number of the promising ideas will go forward to 
the point of full and fair evaluation and to the point of 
generating spinoff technology. Second, since space qualifica- 
tion is an assumed starting point for all thinking, the 
technology employed in mission development is guaranteed to - 
be years behind the state of the art. The chances for pushing 
the state of the art via spin-offs is smaller than it should be. 
Paper studies, on the other hand, tend to produce mostly 
paper. 

Consequently we see the need for a new kind of research 
context, that of the virtual mission. Such missions, would have 
the same sort of shape as real missions, with two key 
exceptions: first, space hardened and qualified hardware 
would not be used; second, the objective would not be to fly, 
but rather to win an eligibility contest. As we see it, there 
would be many virtual missions competing for to-be-flown 
status. Taken together, they would produce a pool of 
alternatives, any of which could be selected and flown, with 
space qualification taking place after, rather than before" 
selection. Since none would be fettered by the limits of space 
qualification for their entire life, all would be more imagina- 
tive, technically exciting, and technically productive by way of "' 
spinoff technology. We believe that the costs involved in doing 
things this way are likely to be reduced. Conceivably, several 
virtual missions could be done, using commercial equipment 
where possible, for the price of one, whereas real missions are 
restricted as now to old fashioned, one-of-an-obsolescent-kind 
antiques. 
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There could be, for example, several groups competing by 
applying different locomotion schemes to the same explora- 
tion job. Similarly, several groups could explore a variety of 
shuttle-based, large-structure assembly ideas. 

We should begin thinking along these lines because 
spacecraft computer hardware is becoming more and more 
out-of-date and something simply must be done about it. 
Today it takes too long to “qualify” space computers. There 
seems to be no mission-independent way to do this. Individual 
missions have to use computers qualified by previous, almost 
accidental, qualification incidents. Memory sizes, in particular, 
are much too small. This leads to weak programs with minimal 
versatility and to doing things in hardware that might be 
lighter and more reliable in software. Therefore, at the very 
least, NASA should have a continuing program to space- 
qualify larger memories and more capable computers, as they 
evolve. We know enough about computation, today, to be able 
to assert that there is little reason to suppose that the 
computers will have to be adapted to particular missions 
much, except in regard to overall capacity parameters. 

Recommendations for Rover Research. Given the need to 
take advantage of imminent opportunities on Mars, we believe 
the design, construction, and systematic evaluation of a 
functional reconfigurable rover should be undertaken to: 

1 . Determine optimal configuration alternatives from the 
standpoint of stability, maneuverability, and clearance 
with weight as a major, if not the major, consideration. 

2. Evaluate alternative wheel/trackingAeg concepts as a 
function of terrain classes with respect to speed, 
steering, obstacle climbing ability, weight, and reliability 
both in the laboratory and out in the field. 

3. Serve as a test bed for the development and evaluation 
of alternative vision/sensor/calculational/guidance con- 
trol systems applicable separately to long-range, 
mid-range, and short-range path-planning levels and to 
integrated systems ultimately. There is a corollary need 
to develop additional sensors to provide real-time sen- 
sory feedback with a much broader range of spatial and 
temporal resolution. 

2. Smart Sensor Technology 

This section comments on NASA programs which use vision 
science to make an impact in its applications and mission 
programs. It summarizes the state of the art in the required 
technology areas, summarizes research recommendations, and 
suggests a structure which will encourage required research. 


NASA conducts large imaging programs which produce 
enormous volumes of images. NASA programs are studying 
means of making image data more available and more useful 
for users (NASA End-to-End Data Management System 
program). Those activities are largely for presentation of 
images to humans for human perception. Those NASA 
projects with large potential benefit to society which involve 
machine visual perception include: 

1. Construction of large space structures, particularly 
communications systems and antennas. 

2. Remote sensing and agricultural resource evaluation. 

3. Cartography. 

4. Meteorology. 

Advances in computer vision would enable increased 
effectiveness of the proposed Mars rover mission. These 
applications require a compact area of vision science and 
technology. NASA’s vision applications are sophisticated. 
Suggestions are made to advance NASA objectives by. 

1. Collaboration with other organizations which have an 
investment in applications requiring similar technology 
and which support research in this area of image science. 

2. Involving the most advanced research groups in research 
program formulation and implementation. 

3. Evaluation of current NASA imaging programs. 


2.1 Introduction 

Automated imaging and mapping systems are planned to 
meet objectives of NASA application programs and missions. 
Earth resources surveys include crop production, water 
resources, land use, forest resources, ocean resources, and oil 
spill monitoring. Meteorological prediction, monitoring, and 
climatic studies already make an impact in daily life. Geolog- 
ical studies include crustal dynamics and a world geological 
atlas. Large space structure construction is likely to be 
important for communications. Automated sensing has a role 
in these applications. 

These activities overlap responsibilities of other organiza- 
tions such as USGS, Forest Service, Defense Mapping Agency, 
etc. Capabilities necessary for NASA functions enable NASA 
to contribute significantly to development of automated 
imaging for civilian purposes. A major part of these NASA 
programs requires innovative and high level research to develop 
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required technology. NASA can lead in development of this 
technology. Organizations with similar responsibilities have 
little resources to sponsor and direct research. A major 
emphasis of this section is that it is important for NASA to get 
‘leverage” in research and applications, that is, to work with 
existing research programs and to work with potential users of 
the technology. 

NASA performs two functions in this area, data distribu- 
tion and information extraction. Most effort has gone into 
data distribution. Much work remains. Current and planned 
imaging missions provide volumes of data beyond existing 
abilities to catalog, distribute, and assess the images. Smart 
sensors for data compression, automated image handling 
facilities, and high performance computers for imaging are 
needed. These needs are recognized by the NEEDS (NASA 
End-to-End Data Systems) program which addresses smart 
sensors, special purpose imaging computers, and image 
handling facilities. 

A greater need exists in information extraction. Here, 
NASA’s objectives require sophisticated vision science which 
has not yet been achieved. That need is recognized within 
NASA. The Space and Terrestrial Applications program is 
soliciting proposals for new technology in remote sensing and 
terrain topography. The content of the recommendations of 
this report is that efforts to develop new technology should be 
intensified, and that they should be strengthened by strong 
participation of major research groups outside NASA and by 
cooperation with other organizations with similar needs. The 
balance between research and production systems should be 
evaluated; a heavy research component is essential. Careful 
examination should be made of current and proposed produc- 
tion systems to evaluate whether they are founded on an 
adequate technology base. 


2.2 The State of the Art 

Remote Sensing and Crop Survey. An examination of crop 
census systems reveals that their performance is very limited. 
In summary, those systems do not seem to have met 
expectations of lowered cost and increased repeatability from 
automated classification. In these systems, humans make 
decisions, aided by computer clustering. The overall system 
accuracy is about 90%. Their computerized classification is not 
that good. What humans currently contribute to classification 
is use of spatial context. Both structural pattern recognition 
and scene analysis offer techniques to use spatial context in 
identification. Structural pattern recognition experiments indi- 
cate significant improvements in performance. Our evaluation 
of the mix between development and research indicates that a 


higher proportion of research and more innovative research 
should be supported, and that research results be incorporated 
into development systems continuously, with little lag. It 
appears that a production system was built with obsoIete°and 
inadequate technology. ' 

A likely requirement for the application of structural 
pattern recognition and scene analysis techniques is imagery 
having much higher resolution than LANDSAT. Eighty meters 
resolution is probably too crude to use structural relations. 
High resolution imaging may make use of aerial photography, 
which is part of NASA’s domain. A crop survey using 
structural analysis at high resolution is perhaps feasible now, 
and will be feasible in a few years. A scenario is outlined below 
which would require about 3.4 years to do a world-wide crop 
survey at 10 8 ops/second. The ultimate resolution is about 

2 cm per pixel. Estimates are based on a two-stage analysis. 
For the first stage, a coarse sampling at 2 m/pixel is probably 
adequate. Alternatively, a coarse grid of linear scans would 
require about the same computation cost. The first stage is 
intended to separate major field boundaries. The second stage 
would use structural analysis at 2-cm resolution on limited 
parts of the fields. The use of smart sensors (for example, edge 
operators under development in the DARPA Image Under- 
standing program) would be useful in this program. Smart 
sensors would cut computation cost significantly. 

The Earth’s area is 2 X 10 19 cm 2 . About 1/4 is land and of ' 
that, half is arable. If we sample 10% at 2 m/pixel, there are 
about 5 X 10 12 pixels. Assume about 1000 ops/pixel for 
reasonably sophisticated processing, and 10 8 ops/second. Then 
the required computation time is 5 X 107 seconds. There are 

3 X 10 7 seconds per year. A single computer would require 
1.7 years now. An equal amount of computation would be 
required for the second stage, for a total of 3.4 years. If we 
assume that semiconductors will increase density at the rate 
of a factor of 2 every two years (a factor of 2 per year is 
the current rate) and if we assume a factor of 4 speed increase 
in 5 years (the historical rate), then in about 10 years, a single 
computer will be able to make a world-wide sampling in four 
days. The vision science and software technology should be 
developed now to make use of that computing power. 

Cartography. The production of maps by traditional means 
is labor intensive. Partial automation of elevation contour- 
mapping has been in use for years by DMA, with analog 
stereo correlation systems. It is often thought that automated 
stereo mapping is a solved problem because there are produc- 
tion systems; however, these systems require a great deal of 
human intervention. Typically , they are interactive systems in 
which the operator redirects the system whenever it gets lost 
and patches up errors. There are problems when tracking over 
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A water and over uniform surfaces such as concrete. ”niey do 
- v b 3 dly at surface discontinuities such as edges of buddings and 
- Vi cliffs. In trees, picking out the ground surface is beyond the 
. capability of the system. The extent of human intervention 
required is enough to decrease mapping speed and limit 
mapping output. 


' The DMA has made a major study in automating cartog- 

i raphy in a largely digital system. DMA studies revealed 
. i extensive requirements for advanced techniques in computer 
’■ science with an emphasis on machine intelligence. There is a 
’ ■ : 'v strong relationship of many DMA concerns with related issues 
v 1 in NASA particularly in the area of scene analysis and 
understanding, large database management, and information 
' ; r-j retrieval. 


Research in stereo vision, some of it supported by NASA, 
has produced stereo systems which work in a research 
1 environment and has produced advances in our scientific 
understanding of stereo vision. A model is emerging of the 
' stereo vision process from which newer high performance 
1 systems are being designed and developed. Preliminary 
:J$ research in linear feature tracing has been carried out and the 
j- results indicate that interactive systems using tracing aids are 
feasible for features such as roads and rivers. There is a 
growing body of research on edge finding systems which will 
support development of such aids to linear feature tracing. 

■ Building large data bases for cartographic applications requires 
the integration of research in vision, machine intelligence , and 
general systems issues in computer science. 

Teleoperators. This issue is shared between the Study 
I Group’s vision and robotics subcommittees. This section will 
x.C?' address only the vision part of teleoperator work in space. The 

K;d building of large space structures for communications systems 

.rt; and possible experimental stations appears likely. The cost of 
maintaining a human worker in orbit, including life support 
systems and shielding from radiation, is estimated at S1.5M 
per year. It is hard to assess the difficulty of maintaining a 
crew of highly trained workers in this hazardous environment. 

■ 'l Possible space power stations and space industrialization 
j projects would involve large construction efforts. Development 
' - of teleoperator manipulation offers the possibility of increas- 

•1 ing the productivity of human workers, while lowering their 
’ risk. Operation with large objects, such as the Shuttle- 

Attached Manipulator, imposes another requirement for 
] advanced teleoperator systems, 
i 

j This technology would contribute to electric power genera- 

! tion, to undersea oil drilling and mineral exploitation, and to 
j rehabilitation of disabled people. Recent incidents with power 


shutdowns in nuclear electric power stations have highlighted' 
the technical problems of servicing reactors. Work is currently 
done in a radioactive environment by humans. Advanced 
capabilities for remote operation with man in the loop offer 
opportunities to reduce hazards to workers, lower the cost, 
and increase the level of maintenance. On-line monitoring and 
maintenance are other possibilities. A high payoff is expected 
for a partially automated system. In this type of system, the 
teleoperator system takes over a set of limited operations, 
using sensing and knowledge of parts. Once the operator has 
positioned the manipulator to approximately the right orienta- 
tion, the system completes the action itself. The payoff is in 
speed and ease for the operator. 

Technical requirements for this application require the 
development of manipulator hardware, control systems, soft- 
ware, and sensor systems, in addition to a vision system. The 
vision system required for the simplest of teleoperator systems 
needs the ability to present multiple views, and could benefit 
from stereo if satisfactory stereo systems can be developed. 
For partially automated systems, stereo vision and the use of 
multiple views are highly important. Even when the views are 
separate (i.e., wide angle views which cannot be fused), the 
sort of modeling which is involved in stereo vision is important 
for autonomous vision in these contexts. Considerable use can 
be made of knowledge of the design of parts and joints, for 
model-based vision systems. 

Mars Rover. A proposed Mars rover mission requires 
considerable onboard autonomy if one expects to achieve the 
objectives of a few hundred meters navigation per day, with 
communication for a short time once per day and round trip 
signal times of twenty-five minutes. The minimal navigation 
device is a laser ranging device. Its two limitations are limited 
range and limited number of samples. These limitations put 
restrictions on its reliability and utility since such a sensor can 
do little in looking for interesting samples. Navigation using 
only this device can be only local, with little look-ahead and 
low resolution. Under these conditions, the rover is likely at 
some time to reach a dead end that it can’t back out of, or 
waste excessive time in getting out of, because of limited 
search strategy options. 

NASA does sponsor some research in stereo vision. This is 
on a small scale and should be expanded. Functionally, stereo 
vision with motion parallax offers capabilities to maintain 
orientation by navigating with respect to landmarks, and to 
allow depth ranging and maping of distant objects by making 
use of large baselines accumulated in motion. It is thus 
possible for the rover to avoid problems and to return to base 
locations. 
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2.3 Recommendations 

We recommend evaluation of NASA participation in the 
development of advanced automation in cartography for 
civilian purposes. Cartography and land use studies appear to 
be important applications areas. Progress in computer stereo 
vision makes possible major advances in cartography. The 
civilian organization with responsibility in this area, USGS, has 
limited facilities and limited research. Because of the strong 
relationship of the Defense Mapping Agency Pilot Digital 
Operations Project with NASA interests, it is recommended 
that NASA maintain strong liaison with the DMA and 
investigate possible collaboration with their efforts. NASA 
should evaluate the DMA planning process to aid in costing the 
development of detailed plans for implementing some of the 
related suggestions of this Study Group. A collaborative 
research program with DMA and USGS would have high 
potential benefit, and would be strengthened by research 
underway in DOD, particularly for cruise missile guidance. 

We recommend the support of research in computer stereo 
vision for teleoperators intended for remote construction and 
maintenance of large space structures for communication 
facilities in space. Antennas and communication systems in 
space appear to have economic benefits in a reasonable time 
scale. We recommend that a small investment be made which 
would increase productivity of remote operations as the cost 
per man-hour in space will be high. Advanced teleoperator 
technology would lower exposure of human workers in the 
radiation belts and increase their effectiveness. The technology 
would be equally useful for large space structures for space 
power stations or space industrialization should NASA 
undertake them. 

We recommend that NASA increase support of computer 
stereo vision for a proposed Mars rover mission. Current 
progress in stereo vision promises improved capabilities and 
increased scientific payoff. 

We recommend that agricultural remote survey applications 
be reevaluated. It is urged that performance limitations of the 
current technology be evaluated. NASA should study the 
feasibility of using more powerful structural pattern recogni- 
tion and scene analysis approaches, and that systems be built 
which incorporate new technology. Crude estimates indicate 
that high resolution structural analyses may be feasible soon 
for crop census. 

We recommend that NASA support basic research in 
structural pattern recognition and scene analysis approaches. 

We recommend that NASA diversify its research base in 
imaging research, that it evaluate the proportion of research to 


development investment. It is suggested that NASA support 
research at centers of excellence in computer vision. Tills 
approach is cost-effective since it is not necessary to support 
whole programs; these centers have broad support and 
well-established programs. This approach provides a means of 
collaboration with related research programs. It is suggested^ - 
that the emphasis be on innovative focused research, not on 
applied research. It is recommended that a vigorous program 
of evaluation by members of the research community be used 
for program formulation and proposal review, and that they be 
involved in a strong periodic program monitoring effort. 
NASA is involved in the forefront of computer vision since its 
intended applications probably are not feasible by old tech- 
nology. Yet, NASA does not have a broad enough base of 
imaging science within its organization. A significant part of 
NASA vision effort should be outside of NASA-related 
centers. It is recommended that hardware development work 
on smart sensors and image processing computers be carried 
out in collaboration with DOD and with broad contact with 
the research community. The NEEDS program represents a 
step toward a systems approach to providing data to users. 
There is a need for a program which integrates this data system 
with the information processing that users actually perform on 
the data. 

3. Missions Operations Technology' 

This section discusses NASA’s current mission operations - 
and attempts to identify several areas in which machine 
intelligence can be brought to bear to increase the automation 
of control operations and to replace humans in time-critical, 
repetitive, and routine decision-making roles. A proposal to 
automate the mission-independent aspects of data collection 
and to provide a uniform facility for embedding mission- 
specific systems in the basic support system is reviewed. 

3,1 Introduction 

NASA currently builds and rebuilds mission-specific soft- 
ware for each mission’s control. Although this state of affairs 
reflects the natural evolution of NASA as a large complex 
organization, there are indications that, without immediate 
and global reorganization of the mission control procedure^, 
both NASA’s science and economy will begin to suffer.* 
Specifically, the Study Group sees a pervasive need to 
centralize and standardize mission operations procedures. In 
this regard, the Study Group sees a clear need for the 
development of a modular, “reusable” nucleus of mission 
operations software. 

The scope of the standardization and centralization should 
include all aspects of mission control, from the lowest levels of 
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j 

i sequencing and monitoring to the highest levels of planning 
- and problem solving. Current problems at the lower levels 
.. ..J relate not so much to lack of mechanization as they do to lack 
; 0 f organization of the existing mechanization. Hence, cleaning 
up the lower levels calls for improved software development 
• - T 1 * and integration techniques. On the other hand, establishing 
H-]i procedures and capabilities to organize and extend the 
effectiveness of the higher levels of mission control seems to 
l? 1 * * call for the infusion of AI techniques; the goals at the higher 

■ levels would be to increase the automaticity of mission 
- : 'i control, replacing humans in time-critical, repetitive, and 

.7 •? routine decision-making roles. 

•V j All indications are that NASA is in immediate need of a 
' ’ : 1 more centralized, modular, and automated mission control 
4-4 center concept. This need for a reusable, centralized mission 
7 control center has already been recognized by certain groups 
/; within NASA. Des Jardains’ POCCNET concept, reviewed 
. 7 below, provides an excellent overview of how mission opera- 
7 ^ tions could be cleaned up and standardized at the lower levels, 
j providing a modular software foundation into which the 
7 specific scientific and technological needs of each mission 

L could be grafted. At the higher levels, there are some AI 

methods that the Study Group feels are ready for immediate 
1 technology transfer, and others that NASA should invest in for 

■ longer term payoffs. We suggest several of the immediate and 

> eventual payoffs from AI in mission operations below, and 

J have included a brief survey of the state of the art in AI 

7 ; j problem solving and programming languages. 

• I 

3.2 State of the Art: Mission Operations 

; Xhe view of mission operations developed by the Study 
.7 Group is that there are three categories of human activity in 

• mission control during a mission s lifetime. 

; ,.'7 1 _ Intimate control activities, where human intelligence and 

expertise seem to be demanded. 

v < 2. Mid-level intelligence problem solving tasks (real-time 

•J flight sequencing, resource scheduling, automatic con- 

fii C t resolution) where humans are extensively employed 
;.7 because of their problem solving and modeling knowl- 

'' f f edge, but where no judgmental decisions per se must be 

7 ! made. 

■irU 

3 . Repetitive monitoring and control activities, where 
enough intelligence and human intervention is required 
7 that humans are presently essential, yet where the tasks 

are unchallenging and wasteful of human resources. 

In this subsection, we highlight what seem to be the most 
important aspects of mission operations from categories 2 and 


3 that might be made more reliable, rapid, or economical if 
partially or fully automated via existing AI techniques. 

Current Missions Operations. Mission operations is the 
control executive for a mission. As such it comprises the 
following specific activities: 

1 . Telemetry and Command - gathering the data transmit- 
ted from the payload, deframing and demultiplexing it, 
reconstructing the original raw telemetry frames, storing 
it, and transmitting commands and/or data to the 
payload. 

2. Payload Navigation - determining actual payload orbital 
parameters, comparing them with nominal values, and 
making minor orbital corrections. 

3. Monitoring - interpreting received data to ensure 
integrity of the craft and performing preventative and 
diagnostic tests and maneuvers. 

4. Sequencing - devising, coding, and transmitting se- 
quencing instructions to the craft for executing science 
experiments and remedying problems. 

5. Data Interpretation and Display — capturing, formatting, 
and displaying scientific and technological data from the 
craft for convenient use by humans. 

6 . Manpower Coordination - sequencing ground-based hu- 
man activities relating to the successful monitoring and 
science gathering of the mission; this includes such things 
as gathering together appropriate subsets of the scientific 
community for judgmental decisions, coordinating rou- 
tine staffing of the monitoring facilities, locating techni- 
cal experts to deal with specific problems, and so forth. 

Although the Study Group saw a wide spectrum of detail 
across the various projects and missions within NASA, every 
project and mission seems to demand these core activities. 
Indeed, it appears that only a small fraction of a mission s cost 
in manpower and planning derives from the unique scientific 
aspects of the mission; without a doubt, the bulk of missions 
operations is common to all projects within NASA. 

Nearly everyone in NASA seems to realize this. Yet there 
seems to be such great inertia from NASA s early days of rapid 
growth that no one seems able to initiate cross-mission 
technologies that would coalesce missions operations. We saw 
one notable exception, however; des Jardains’ proposal for an 
automated, reusable Missions Control Center (POCCNET- 
RTOP #310-40-40). Des Jardains’ proposal is well-conceived; 
but, as he points out, even the most ambitious automation 
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projects can profit from the use of AI technologies. Since we 
feel des Jardains’ proposal represents a large step in the right 
direction, and since we have a relatively clear picture of where 
the incorporation of AI techniques might significantly enhance 
des Jardains proposal, we first summarize his ideas; then 
suggest how the concept can be extended by incorporating 
existing AI problem solving and representation technologies. 

Des Jardains’ Proposal. Des Jardains’ proposal concentrates 
primarily on the concept of a reusable missions control system 
which automates major portions of several of the categories of 
mission operations above. He thinks in terms of a “payload 
operations cycle” in which requests for data or science from 
users are queued up and scheduled by missions planning, 
taking into account their priorities, sequencing demands, and 
the current state of the craft and its sensors. The output of the 
missions planner is an “as-planned payload activity timeline,” 
which, when combined with parametric data indicating the 
craft s current state, yields a specific sequence of commands to 
the craft. Results of commands yield an “as-performed time 
line, which reports back to the data management phase of the 
cycle. This phase organizes raw data collected during specific 
intervals, correlates them with the as-performed time line, and 
with original user requests, then delivers the data to the user. 
An intelligent data management facility would also notice 
missing data and unfilled user requests, and act as a sort of 
ombudsman for all users, following up with its own requests to 
mission planning to fill unmet original user requests. 

Des Jardains’ proposal is essentially (1) to automate the 
mission-independent aspects of this data collection and deliv- 
ery cycle (and its implicit sequencing) and (2) to provide a 
uniform facility for embedding mission-specific systems in the 
basic support system. Since the sources of information with 
which the system must deal are both technically and geograph- 
ically diverse, the proposal takes the form of a computer 
network which des Jardains calls the payload operations 
computing cycle (POCC) net. 

As des Jardains correctly points out, such a POCC net 
would solve a number of NASA’s current problems relating to 
mission cost, turnaround time, and efficiency. Currently, in 
the absence of a uniform, reusable facility, each mission must 
develop its own special purpose systems which are not reliable 
and which often just barely work. Users often must suffer 
lengthy delays, and must pay individually for computer time 
that relates more to NASA mission management than to the 
science the user derives. Original mission support teams break 
up and leave, taking with them much of the esoteric mission 
specific knowledge, making it difficult to train new staff to 
support the mission for the duration of its lifetime. In short, 
time, money, and effort are wasted by not having a central' 


facility that serves as a large, automated backdrop of uniform 
computing resources useful to any specific mission. 

AI Techniques: Mission Monitoring. The volumes of para- 
metric data sent back from a craft are sampled, abstracted, and 
formatted for meaningful interface with human controllcrs.- 
The role of a controller is to apply a knowledge of the" 
mission s goals, the craft’s capabilities and physics, and the 
current-phase phase of the mission in interpreting the data-he 
sees. Our impression has been that this aspect of mission 
operations remains essentially unautomated, except possibly 
for continually improving sampling techniques, display tech- 
nologies, and human interfaces. Our message to NASA is that 
this is an ideal area for increased automation from AI. 

The key to automating this aspect of missions operations 
lies in the areas of symbolic modeling and representation, two 
of the pivotal areas of AI. Presently, the human’s presence in 
the monitoring loop is required simply to make connections 
between what the human’s symbolic model of the mission and 
craft say should be happening at any moment, and what is 
actually happening. In this role, the human monitor draws 
primarily upon his knowledge of cause-effect relationship, 
ones which are specific to the craft and others which are of a 
more generic nature. Because of what he knows about the 
current phase of the mission, he is able to compare the 
incoming parametric data with the expected conditions, 
supplied by his model. When anomalies arise, he can not only 
recognize them, but also use them in combination with his 
symbolic model to hypothesize the nature of the fault. He 
could then issue further diagnostic commands to the craft, 
commands that would remedy the fault, or commands to 
reconfigure and bypass it. 

Such symbolic modeling, including representation of the 
craft, representation of cause-effect principles, symbolic simu- 
lation, and fault modeling and diagnosis, are favorite AI topics. 
Much of the best AI research has been carried out in these 
areas, and the Study Group feels that parts of this science are 
ready for transfer into NASA. 

AI Techniques: Sequencing and Control. The Study Group 
heard reports of the agonizingly slow methods of controlling 
Viking. The process of conceiving, coding, verifying, and 
transmitting commands to the arm and science packages 
aboard Viking apparently took times measured in weeks, even 
for relatively modest operations. The Study Group appreciated 
the uniqueness of the first missions, and concurred that the 
procedures used were essential, given the importance and 
novelty of the Viking missions. However, as NASA proceeds 
with increasingly complex scientific missions, the increasing 
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autonomy of craft will demand far more automated sequence 
control regimes, both on the ground and on the craft. This 
appears to be another topic closely fitting current AI work. 

The sequencing task appears to progress as follows. A 
committee of scientists convenes and decides on some imme- 
diate science goals. These are then roughly mapped onto craft 
capabilities, with some preliminary consideration that the 
goals are feasible, consistent with one another, and so forth. A 
team of experts is given the general goals; then produces a 
general sequencing plan. The general plan is progressively 
mapped down to the individual command level, resulting in a 
sequence of primitive steps to be sent to the craft. Before it is 
sent, however, the sequence must be verified both manually 
and by computer simulation to (a) meet the science goals and 
(b) to preserve craft integrity in all respects (electrical, 
mechanical, thermal, logical). After the sequence has been 
scrutinized, it is sent a step at a time, with very careful 
attention to feedback from the craft to ensure successful 
completion of each step before proceeding to the next. In a 
mission with the relatively simple arm and TV facilities of 
Viking, the bottlenecks seem to be the code sequence 
verification step and the feedback loop in which the sequence 
is administered to the craft. The conception of plans, and their 
mapping onto craft capabilities do not appear to be the 
bottlenecks. However, in a more complex mission all phases of 
sequencing will be bottlenecks, if attempted using the same 
level of control techniques found in Viking. 

One of the larger areas of AI, problem solving, is directly 
relevant to all phases of mission sequencing. This is the study 
of the logical structure of plans, and their automatic genera- 
tion for complex sequencing tasks. The Study Group is again 
unanimous in its opinion that AI problem solving theory is 
largely ready for use by NASA in complex sequencing 
environments, both ground-based and on semi-autonomous 
craft. Putting more sequencing intelligence on the craft 
becomes increasingly attractive as ground-craft distances in- 
crease and effective communication bandwidth decreases.. 

The scenario of a semi-autonomous craft with onboard 
problem solving intelligence and a symbolic model of its own 
capabilities might go as follows. Scientists decide that a sample 
of reddish material spotted about 15 meters away should be 
analyzed by science package 21. Using graphics techniques, 
they draw an outline around the sample on the TV image. 
Using this outline to identify the object of interest, the 
onboard vision system converts the image data to coordinate 
data in its local coordinate frame. The vision system issues the 
goal of causing a piece of the sample located at the coordinate 
to be transported to the input hopper of science package 2 1 , 
located at another known position. The navigation problem 
solver then generates a course, moves the craft to within arm’s 


distance of the sample, reaches, grasps, then verifies visually 
and by tactile feedback that a red mass exists in its grasper. It 
then plans an arm trajectory to package 21’s input hopper, 
noting that the flap of package 13 is up, and must be avoided. 
After moving the sample to the hopper and ungrasping, it 
visually verifies that a red mass exists in the hopper, and no 
longer exists in the grasper. It turns on package 21, and reports 
back to ground. 

Everything in this scenario is within the means of current or 
forseeable AI problem solving, manipulator, vision, and naviga- 
tion technology. Its primary feature is that, because of a 
self-model and knowledge of problem solving strategies, the 
craft can do more science with less ground-based support in a 
given period of time. Furthermore, the advantages of such 
technology on any particular mission are miniscule when 
compared to the advantages NASA will derive from the 
underlying technology. Again, just as des Jardains has pointed 
out for the lower level aspects of mission operations, what 
NASA sorely needs is a mission independent repertoire of 
basic problem solving packages which can be molded around 
the automatic sequencing needs of each mission in a uniform 
way. 

3.3 Recommendations 

Up to this point, NASA has concentrated on those activities 
that, in a primary sense, result in successful missions. That is, 
NASA designs and builds the equipment required for space- 
related science. This includes ground-based control equipment 
and procedures, as well as the spacecraft and its support 
systems. The Study Group strongly feels it is essential that 
NASA begin to look at some metaissues of how to codify the 
knowledge it uses in primary development. AI research has 
shown that codification of the knowledge underlying the 
primary advances in a field can lead to a better understanding 
of the basic issues of the field. In NASA’s case, the immediate 
and long-term payoffs from codification of existing knowledge 
about mission operations would be in increased automaticity, 
if the primary technologies underlying mission operations can 
then be handed over to the computer. As the computer 
assumes progressively more of the intelligent control func- 
tions, more ambitious missions become possible, each mission 
becomes cheaper, and the scientific community can be put in 
closer touch with the onboard science. 

The Study Group’s message to NASA is, therefore, that 
NASA is becomming more and more an information utility 
and less and less a space hardware enterprise. Because of this, 
NASA needs to begin new mission independent programs for 
managing information during a mission. The first step toward 
creating a metalevel (information-based, rather than hardware- 
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based) technology within NASA is the development of a 
unified Mission Control Center, with the goal of increasing the 
mechanization and standardization of sequencing, data han- 
dling and delivery, and related protocols at the low levels of 
the system, and increasing the automaticity of the center at 
the higher levels by introduction of existing AI problem 
solving and symbolic modeling techniques. 

To begin the development of such a reusable, modular, 
intelligent Mission Control Center, the Study Group makes the 
following recommendations. 

1. That NASA look seriously at des Jardains’ proposal and 
establish a mission-independent fund for supporting the 
development of a system such as DesJardains proposes. 

2. That NASA create a special internal, cross-mission 
division whose primary charge is to interact with the AI 
community on issues of increased automaticity, using AI 
techniques, throughout NASA mission operations. The 
division would serve as a membrane through which 
theoretical AI and advanced computer science could 
flow into NASA to meet practical mission operations 
needs. The division would eventually become a mission- 
independent resource from which the mission planners 
for individual missions could draw advanced control 
techniques for their specific goals. 

3. That NASA charge the new division with constructing 
symbolic models of mission operation, and applying 
those models in the organization of an intelligent soft- 
ware library for use in specific missions. This library 
would provide basic AI technological support for auto- 
mating various aspects of specific missions. It would 
serve much the same function as a machine shop now 
serves; but rather than new experimental hardware, it 
would draw upon advanced AI and computer science to 
provide mission-specific software tools, ranging from 
symbolic models of a spacecraft to models of the 
scientific uses of information derived from the craft. 

4. That NASA adopt and support one of the advanced AI 
programming languages (and related research machinery) 
for use by the AI division in its role as a NASA-wide 
advanced technique resource and information facility. 

4. Spacecraft Computer Technology 

The intent of this section is to discuss computer require- 
ments for onboard spacecraft operations in future NASA 
missions. Space missions have special computer needs that do 
not pertain in ground use of computers. The special needs and 


requirements that will be imposed on computers to meet 
scientific missions of exploratory space flights in the areas of 
fault tolerance, large scale integrated circuits, space qualifica- 
tion of computers, computer architectures, and research 
needed for space computers are discussed. Recommendations 
of actions to be taken by NASA are specified for each of these' 
areas. 

4.1 Technological Need 

Computers in outer space face severe architectural con- 
straints that do not exist with respect to ground-based 
computer operation. Because of this, special considerations 
must be taken with space computers that do not necessarily 
generalize from ground experience. The aspects that require 
special attention are discussed below. 

1. Power and weight constraints are important for space 
missions. Fortunately, work in large scale integrated 
(LSI) technology has played a major role in decreasing 
power and weight requirements for computers. 

2. Hostile space environmental conditions require that the- 
computer be shielded from radiation, extreme tempera- 
tures, mechanical stress, and other space conditions. 

Operational Requirements 

1. Component reliability is essential since manual repair or 
maintenance in the conventional sense is not possible. 

2. Autonomous operation of the computer is essential as 
there will be limited communications with Earth-based 
systems. 

3. Computers must be both electronically and logically 
fault tolerant to: 

— Provide long operational life. 

— Provide self-contained recovery from transient and 
permanent faults. 

— Control automatic maintenance of the entire space- ~ 
craft. Error conditions must be readily detectable and 
isolated to permit recovery operations. Errors may be 
of two major varieties. 

— Physical faults due to component failures, temporary 
malfunctions, and external interference. 

— Man-made faults due to imperfections in the specifi- 
cations and bugs in the program. 
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Scientific Needs 

The scientific needs for space mission computers may vary 
greatly. Once a mission is approved and the science objectives 
are specified, it is necessary to analyze each scientific 
experiment to determine its needs for computation. Because 
of the development of microcomputer technology it is not 
unreasonable to place a small microcomputer into a scientific 
device to provide it with more intelligence. Hence, there will 
be a need for microprocessors. 

To support devices which will be used to explore a celestial 
body, and which will exhibit “intelligent” behavior, large-scale 
computers will be necessary; that is, large, fast primary 
memory storage and backup storage devices will be required. 
Processing pictures, and developing detailed plans to permit 
robotic devices to operate in space so as to accomplish mission 
objectives given general guidance from ground, will be neces- 
sary. Large amounts of space and time are required to process 
real-time programs for robotics and machine intelligence. 

4.2 State of the Art: Architectural 
Alternatives for Space Operations 

The use of computers for space missions has been evolving 
since the start of the space age. First-generation space missions 
essentially had no computers. Second-generation missions had 
centralized computers that performed all computations re- 
quired by the mission. Third-generation computers are now 
being considered. Three different computer architectures can 
be considered for space operations: distributed microcomput- 
ers, centralized processor, and distributed networks of com- 
puters. Some of the advantages and disadvantages of each 
approach will be explored below. 

Distributed Microcomputers. If one is to have many small 
devices with their own built-in intelligence via a microproces- 
sor, then a distributed microcomputer configuration is highly 
desirable. Such a concept has many advantages both from a 
technological view and a management view. A distributed 
network should permit any microprocessor qualified for space 
to be interconnected to the system. The interface between 
modules should be simple as the devices should be relatively 
independent of one another. Hence, errors can be isolated to 
devices, and it should simplify design problems. A simple 
executive routine could be developed to control the devices. 

There are some virtues to a distributed microcomputer 
approach: 

1 . Changes in software sent from the ground to enhance a 
device need not pass through extensive reviews as the 
change affects only one experiment. Hence, coordina- 


tion between experimenters and the various software 
will not, in general, be necessary. 

2. Software needs to be developed primarily for small 
problems. The code will be short, and in most instances, 
will be written by one programmer. Hence, software can 
be verified and tested more readily than can large, 
complex software. 

Some disadvantages of a distributed approach are: 

1 . Space, weight, and computer memory requirements may 
be larger than that ’for a centralized approach since 
memory and logic is not being shared. 

2. “Intelligent devices” that have their own microproces- 
sors cannot obtain more memory than initially planned 
for the space mission. There may be instances whereby 
information learned on the ground could cause new 
software to be developed for the device. However, unless 
the new software fits into the preplanned memory size, 
it will not be possible to make the change. 

Centralized Processor. In a centralized processor system, all 
functions relative to “intelligent” devices are placed in one 
computing system. Devices may time-share the central proces- 
sor so as to have the same effect of “intelligence” as with a 
distributed processor system in which the “intelligence is 
built into the device with a small microprocessor. A central- 
ized processor would have a dynamic storage allocation 
routine built into it to account for space required by separate 
programs. 

Some of the virtues of a centralized processor configuration 
are: 

1. Large, fast memories become available for complex 
“machine intelligence” tasks such as picture processing, 
high resolution, and plan formation needed to permit 
robotic devices to explore terrestrial bodies in space. 

2. “Intelligent devices” that time-share the central proces- 
sor can have their “intelligence” augmented by new 
software since more core memory should be readily 
acquired from the dynamic storage allocation routine if 
needed. 

3. Space and weight is saved since only one control logic is 
required for the single computer, and memory is shared. 

Some disadvantages are: 

1. The executive routine for the central processor will be 
complicated and verification of the executive routine 


0 


37 


will be more complex than for the distributed processor 
approach. 

2. Changes in software made on the ground to enhance a 
device may require extensive coordination and testing on 
the ground before it can be approved and transmitted to 
the spacecraft. 

Distributed Networks of Computers. In a distributed 
network of computers, tasks to be performed can be assigned 
to any of the computers in the network. Peripheral devices and 
memory in each of the processors tan be shared. Many central 
processors permit parallel computing to take place. A virtue of 
such an approach is that if one central processor fails, 
computation can still continue since other processors can be 

used to perform the work, albeit at a reduced processing 
speed. 

Some disadvantages of the approach are: 

1. Complex executive routines are required to control and 
to transfer data between processors. 

2. A considerable amount of time may be expended to 
simply manage the configuration than in performing 
work in support of the scientific mission of the flight. 

Fault Tolerance. Computers sent into space must be robust. 
They must be able to operate in space even when malfunctions 
occur. Fault tolerance is an attribute of information processing 
systems that enables the continuation of expected system 
behavior after faults occur. Fault tolerance is essential to space 
missions as it is impossible to adequately test components of 
transistor-like devices on a single chip. A single computer 
would have hundreds of such chips. 

Faults fall primarily into two fundamentally distinct 
classes: 

- Physical faults caused by adverse natural phenomena, 
component failures, and external interference originating 
in the environment. 

— Man-made faults caused by human errors including 
imperfections in specifications, design errors, implemen- 
tation errors, and erroneous man/machine interactions. 

Fault tolerance and fault-avoidance are complementary ap- 
proaches to the fault problem. Fault avoidance attempts to 
attain reliable systems by: 

— Acquisition of the most reliable components and their 
testing under various conditions. 


Use of thoroughly refined techniques for the intercon- 
nections of components and assembly of subsystems. 

— Packaging and shielding of the hardware to screen out. 
expected forms of external interference. 

- Carrying out of extensive testing of the complete system 
prior to its use. 

Fault tolerance of physical faults attempts to employ 
protective redundancy, which becomes effective when faults 
occur. Several redundancy techniques are: 

- Fault masking to assure that the effect of a fault is 
isolated to a single module. 

- Fault detection to detect that an error has occurred so 
that a recovery algorithm may be initiated. 

- Fault recovery to correct a detected fault. Automatic 
recovery algorithms are essential for space flights since 
human intervention will not be possible. 

Fault masking appears to be a good approach primarily for 
short missions that consist of several days duration. Both 
hardware and software controlled recovery systems are re^ 
quired for successful space operations. 

Two techniques for realizing fault tolerance of man-made- 
faults are: 

— Design faults: prove correctness of programs and mathe- 
matical models for software reliability and prediction 
(both are in the research stage); “software engineering” 
techniques include procedures for the collection and 
analysis of fault data; management procedures for 
software development; structures programming approach 
, to program design; and software verification and valida- 
tion techniques. 

- Interaction faults due to man/machine interaction errors. 
Control of such faults has been implemented primarily 
by operator training and maintenance manuals. Tech- 
niques used in AI could suggest approaches that would’ 
eliminate this kind of fault by screening all inputs. 

LSI Technology. Large scale integrated circuit technology - 
has yielded relatively large processors on small chips. These 
devices are highly important for space technology. Today’s 
high performance MOS microprocessor has the following 
features: 

— Architecture — 16 bit minicomputer on one chip. 
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- Cycle Time — 125 nanosecond operation speed. 

— Power — 1.0 watt. 

— Die Size — 5.25 millimeters on a side. 

It is not clear, however, that such a fast device could be space 
certified in the near future. 

Future high performance MOS microprocessors are likely 
to have the following features: 

— Architecture — Full scale information processing system. 

- Cycle Time - <100 nanoseconds. (Such a speed may not 
be achieved in the near future and may require an even 
longer time to be space qualified.) 

— Power — 2-4 watts. 

— Die Size — 6.5 millimeters in a side. 

— Device Count — 60,000. 

In addition, it would have a large logical address space, 
multiprocessing capability, a language orientation, and a 
firmware operating system. 

Space - Qualified Computers. Space qualified computers 
appear to be lagging significantly behind ground-based 
computers both in speed and memory capacity. Specifications 
for a fault-tolerant space computer (FTSC) under development 
at the Raytheon Corporation are as follows: 

— Operations/second — 250,000. 

- Word and Memory* Size - 32 bit words up to 60 K 
memory. 

— Operations — floating point and vector operations. 

- Weight — 23 kg for a 60 K memory and 36 K spares 14 g 
for 16 K memory and 12 K spares. 

— Power — 25 watts. 

The system is expected to be triply redundant, where all 
modules are on single chips. 

4.3 Recommendations 

Digital computers onboard spacecraft have been playing an 
ever increasing role in NASA space missions. They are destined 
to play a dominant role in future space missions. The 


miniaturization of computers that has revolutionized com- 
puters on Earth provides even greater opportunities for space 
missions. They will permit NASA to develop “intelligent” 
sensors and devices which permit information, rather than raw 
data to be acquired in space and be sent to Earth. Significant 
size computers can be developed which will permit robotic 
devices to be built and controlled using general plans devel- 
oped on Earth. Such devices will permit the terrestrial 
exploration of remote bodies that cannot be explored by man. 

Fault Tolerance and Hardware. Whereas the development of 
smaller, more powerful computers on chips will progress 
without support from NASA, these developments will not 
meet NASA needs for spacecraft. Ground computers do not 
require absolute fault tolerance. Because they are relatively 
inexpensive, chips can be replaced on the ground. This, 
however, is not possible onboard spacecraft, where fault- 
tolerance is crucial to the success of a mission. Fault-tolerant 
hardware systems need to be supported both by NASA and 
the Department of Defense who are also concerned with 
computers onboard spacecraft. If funding were coordinated, it 
could benefit both organizations. Fault tolerance must pro- 
ceed at two levels - considering both hardware and software. 
At the current time, a major problem exists with respect to 
large scale integrated circuit technology. Because of their 
complexity, chips cannot be tested adequately now. Random 
logic chips (e.g., INTEL 8080) may have failure rates that are 
unacceptable for space use. The random logic makes it 
extremely difficult to test chips adequately. 

A hierarchic, or top-down, approach to designing chips, 
rather than random design methods could increase chip 
reliability and permit easier testing. NASA should support 
efforts in hierarchic design, or other design techniques which 
will improve chip reliability and ease of- testing. Until major 
developments are made by manufacturers in improving the 
reliability and testing of chips, NASA should plan to test its 
own wafers thoroughly before qualifying them for space. 
Testing performed by manufacturers on wafers has been, at 
best, poor. Planning for fault tolerant hardware must start at 
the inception of a space mission and must be a part of the 
mission management plan. 

Fault Tolerance and Software. Fault tolerance is needed 
not only for hardware, but also for software. Because of a 
trivial software error, an entire space mission costing billions 
of dollars can be lost. By having intelligent devices with their 
own hardware and software, small programs, relatively easy to 
code, verify, and test can be developed. Howeyer, one cannot 
always guarantee small programs. Hence, a fault tolerant and 
software effort must be initiated at the inception of a mission 
and must be an integral part of the management plan. 
Software recovery procedures and algorithms to handle single 


39 


and multiple failures are required, and need considerable 
research. A systematic effort is needed for error detection and 
recovery algorithms for space computers. Fault-tolerant hard- 
ware and software for space computers is still in its infancy 
and needs considerable support from NASA. 

Computer Architecture. There is no one computer architec- 
ture uniquely suited to all NASA’s needs. The particular 
architecture for a specific mission will depend upon the 
mission objectives. The three architectures discussed in Subsec- 
tion 4.2, all have advantages and disadvantages. The distrib- 
uted processor concept and large central processors are useful 
architectures and should be considered for near and future 
term space missions. However, the distributed network of 
computers requires considerably more research to determine 
its applicability to space operations. Because much is still not 
known about the control of distributed networks on ground- 
based systems, this type of architecture is not realistic for a 
Mars 1986 flight which would include a robotic device. A 
distributed processor concept is attractive from a management 
I view of space computing. It provides for separation of 
functions. It is particularly useful for missions on which 
intelligent devices and sensors have special timing require- 
ments that cannot be fulfilled by a central processor. 

Missions that require robotic devices will require large 
central processors. Because of weight and space limitations, 
“intelligent” devices should be reviewed carefully on such 
missions to determine if their needs could be met by the 
central processor. If this is possible, then the central processor 
should be shared to service the device. Trade-off studies will be 
needed to determine the role of the central processor and the 
number of “intelligent” devices that will meet the space- 
weight restrictions. Computers are destined to play essential 
roles in space missions. They will have a major impact on 
“intelligent” devices and sensors. Exploration of terrestrial 
bodies by robots can be possible only with adequate computer 
hardware and software. NASA must place greater stress and 
funds into the support of space-board computers, fault 
tolerant techniques and systems, and software support for 
future space missions. 

5. Computer Systems Technology 

This section addresses the use of computer technology 
throughout NASA. We will review the rapidly evolving state of 
hardware technology and describe its implications upon the 
practicality of machine intelligence. 

5.1 Introduction 

With computer technology so central to the organizations’s 
mission, and consuming such a large percentage of its 


resources, one would expect to find, a massive research and 
development program to advance this technology and thereby 
further its mission objectives. Yet we have found scant 
evidence of NASA innovation within this field, and strong, 
indications that it is not even adequately adopting technology 
developed elsewhere. As an indication of this lack of innova- 
tion, though it is certainly not conclusive evidence, at the most 
recent AIAA Computers in Aerospace Conference (1977) 
sponsored in part by NASA, only four of the eighty-six papers 
presented (less than 5%) were by NASA Headquarters or 
NASA centers people. 

5.2 State of the Art: Computer Systems 

In the workshop deliberations of this Study Group several 
trends within NASA have become quite apparent which may 
seriously proscribe the potential benefits available from 
spacecraft based machine intelligence. It is therefore important 
to identify these trends, uncover their basic cause, and suggest 
alternative cures which preserve and enhance the opportunities 
to utilize machine intelligence. These same trends also exist, 
though to a lesser extent, for ground-based systems, and hence 
have broad applicability throughout the agency. 

NASA Missions Are Engineered and Preplanned to Minimize 
Dependence on Autonomous Operations. Because of NASA’s 
no-fail philosophy for missions, an extremely conservative 
force is applied to mission plans and objectives. All aspects of 
the mission are carefully thought out in minute detail and all 
interactions between components meticulously accounted for. 
Besides increasing mission planning costs and lead time, the 
resulting plans are extremely inflexible and are incapable of 
having experimental components. As an example of this 
approach, the Mars rover mission reduced * the need for 
autonomous control to local obstacle avoidance within a 
30-meter path. The rest of the control was provided via ground 
supplied sequencing produced on an overnight basis. As a 
result half of the available picture bandwidth was devoted to 
pictures for the ground based path planning function rather 
than for science content, and no autonomous capability was 
provided to photograph and/or analyze targets of opportunity 
not selected in the ground-based plan. Similarly, in ground- 
based systems, we found evidence that investigators working in- 
data reduction were not able to use the most advanced 
technology available because the NASA monitors were not 
convinced that it was 100% reliable. Instead, a proven, but 
obsolete, method requiring much more user intervention was 
chosen because of NASA excessive conservatism and because 
the concept of experimental upgrade of baseline capabilities 
has not been embraced. Clearly what is needed is a new 
mission planning model which establishes minimum baseline 
capabilities for mission components, enables use of enhanced 
versions, and provides protection from component malfunc- 
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tion with automatic reversion to baseline capabilities. While 
\ such redundancy is commonplace in hardware, similar soft- 
ware redundancy, especially utilizing enhanced “experimen- 
’ tal” versions is quite novel, but technically feasible within a 
properly constituted operating system. Developing such a 
* capability is part of our recommendations below. 

Q Increased Use of Distributed Computations. There appears 
to be a strong push for correlating software function with 
hardware modules, so that software separation is paralleled by 
hardware separation. This tendency seems to be predicated on 
■ : the current inability to separate and protect software modules 

from one another except by placing them in separate hardware 
•** units. 

. - * ^ 

- ‘ The cost of this practice is to preallocate computing 

resources on a fixed basis rather than dynamically allocate 
■. ' them from a common pool. This results in underutilization of 
the computing resource, reduced capability, and/or decreased 
system flexibility. Current machine intelligence systems re- 
ivi: quire large allocations of resources, but they only utilize them 

. ; intermittently. Since such machine intelligence systems will 
• - initially be experimental they are less likely to justify fixed 

allocation of the resources only occasionally required. 

. ' L The benefits of increased utilization of dynamically allo- 
■■■"; cated resources could be realized if protection mechanisms 
' enforcing separation of software modules required above for 
, • “experimental” versions and a resource allocator (a standard 
. ' part of operating systems) existed. 

ii'yt Use of Standardized Computer Hardware. As part of 
NASA’s standardization program, standards are being set for 
... ; onboard spacecraft. This is intended to reduce costs by 

. -.i avoiding new hardware development and space qualification 
efforts, decrease development time by ensuring the early 
! availability of the hardware, and increase reutilizations of 

•A j existing software. However, since hardware technology is 

.. changing faster than the mission launch rate, standardization 

. results in the use of obsolete hardware. This limits the 

' : j resources available to any machine intelligence system. Devel- 

opment of software portability, or equivalently hardware 
, : VJ compatability in a family of machines, would mitigate all of 

■■■■'! these problems except for the time and cost of space 

: qualification. 

Long Lead Times Required by System Integration. Cur- 
j rently all software, like all hardware, must be created, 

V‘-..j debugged, and integrated many months before mission launch 

. j to ensure proper spacecraft functioning. But unlike hardware, 

. | software can be modified after launch via telemetry. This is 

j especially important in long-life missions lasting many years. 

1 During that period, as the result of increased scientific 


knowledge, better software development techniques, and/or 
changed mission objectives, there may well be a need to 
modify and/or update the onboard software. 

The benefits would be reduced lead time for software and 
an increased flexibility in mission objectives. This capability 
could be quite critical to early utilization of maturing machine 
intelligence technology. This notion is equally applicable for 
ground-based systems which may be utilized long after the 
mission launch date. They too must be capable of being 
upgraded, modified, and/or supplanted by experimental capa- 
bilities during mission operations. The basis for such capability 
is a centralized pool of computing resources with dynamic 
allocation and a protection mechanism for system integrity. It 
Should be noted that this notion has already been incorporated 
into the Galileo mission plan (though for cost rather than 
flexibility reasons) in which the spacecraft software will be 
delivered after launch. 

Desire to Minimize Onboard Software Complexity. This is 
part of NASA’s larger effort to minimize spacecraft complex- 
ity to increase reliability. As above, special recognition of 
software’s unique characteristics must be made. Otherwise 
onboard capability will be unnecessarily restricted. The mini- 
mized complexity criteria should be applied to only the 
baseline software and the protection mechanism, for this is the 
only portion of the spacecraft software relating to reliability, 
rather than the entire package of “enhanced” modules. With 
such an approach, capabilities, including experimental machine 
intelligence systems, could be incorporated in spacecraft 
without compromising reliability. 

Central to each of these spacecraft trends is the notion that 
software is an ill understood and difficult to control phenom- 
enon. It must therefore be managed, restricted, and carefully 
isolated into separate pieces. This notion has, in the past, been 
, all too true, and its recognition has been manifest in the trends 
described above. However, current experience with time- 
sharing systems have developed techniques, combining hard- 
ware facilities and their software control, for providing 
separate virtual machines to several processess. Each virtual 
machine while protected from the others shares a dynamically 
allocated pool of resources (such as memory, time, and 
bandwidth) which may include guaranteed minimums. With 
such a capability, simulating separate machines via a hardware/ 
software mechanism, all of the reliability advantages of 
separate machines are retained while the flexibility of dynamic 
resource allocation are also achieved. With proper software 
design these capabilities could be built into a general facility 
for incremental replacement of baseline modules by enhanced, 
and possibly experimental, versions with automatic reversion 
to the baseline module upon failure of the enhanced module. 
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5.3 Computer Systems Development 
Recommendations 

We recommend a virtual machine and software-first ap- 
proach to system development: 

1. For both ground and spacecraft software, that NASA 
develop a virtual machine approach to software develop- 
ment in which protection between processes is main- 
tained by the operating system which also allocates 
resources as required with certain guaranteed minimums. 

2. That within such a facility provisions be made for 
supplanting modules with upgraded or “experimental” 
versions. The operation of such modules will be moni- 
tored automatically and/or manually and upon failure 
will be automatically replaced by the reliable baseline 
module. 

3. That NASA adopt a “software first” approach so that 
hardware can be supplied as late as possible (to take 
advantage of the latest capabilities). To support such an 
approach, either software portability (for newly devel- 
oped code) or compatible machine families must be 
provided. 

6. Software Technology 

This section makes recommendations concerning the use of 
machine intelligence to further the production and mainte- 
nance of software throughout NASA. In addition, we will 
strongly recommend increased utilization of (non-machine 
intelligence) computer science to improve NASA’s current 
capabilities in software. 

6.1 Introduction 

NASA is basically an information Organization. Its mission 
is to collect, organize, and reduce data from near- and 
deep-space sensors into usable scientific information. Comput- 
ers are obviously essential to this mission as well as to the 
launch and control of the spacecraft involved. Annual com- 
puter expenses, for both hardware and software, represent 
about 25% (?) of NASA’s total budget. Compared with other 
users of computer technology, such as military and commer- 
cial organizations, NASA appears to be merely a state of the 
art user. But compared with the programming environments 
found in universities and research institutes from which 
this Study Group personnel panel was drawn, there is a world 
of difference. The technology lag represented by this gap is 
not NASA’s responsibility alone, but is indicative that an 
effective technology transfer mechanism does not yet exist 


within the computer field. NASA would do well for itself, and 
set a fine example, to remedy this. 

There are two main issues we wish to cover in this section. 
The first concerns characterizing the state of software develop- 
ment within NASA, comparing it to the advanced software - 
development facilities available in selected universities and 
research institutes, and outlining a short-term plan to effec- 
tively transfer this technology into the NASA environment. 
Secondly, there exists some preliminary, but far from practical 
machine intelligence work on automating various parts of the 
software development process. We will briefly examine this 
work and its potential for NASA; then suggest an appropriate 
role for NASA in this field. 

6.2 State of the Art 

Software Development within NASA. With rare exception, 
NASA software is developed in a batch environment. Often 
the medium is punched cards. Programs are keypunched and 
submitted. Results are obtained from line-printer listings hours 
or even days later. The only debugging information provided is 
what the programmer explicitly created via extra statements 
within the program. Deducing the cause of a failure from the 
debug evidence produced is a purely manual operation. 
Changes are made by keypunching new cards and manually 
merging the corrections with the program. Then the cycle is 
repeated. In some NASA environments, cards have been 
replaced by card images stored on a file and corrections are' 
made with an online editor, but the process is essentially the 
same. Only the keypunching and manual manipulation of 
cards has been supplanted. The programs are still developed 
and debugged in a batch mode. 

Software Development in Machine Intelligence Labora- 
tories. In striking contrast to the NASA program development 
environment is that existing at several laboratories (such as 
CMU, MIT, Stanford, BBN, ISI, SRI, and Xerox) working on 
machine intelligence. This environment is characterized by 
being totally online and interactive. The heart of this 
environment is a fully compatible interpreter and compiler and 
an editor specifically designed for the language and this 
interactive environment. The remarkable thing is that this 
environment is based neither on machine intelligence mecha- 
nisms nor concepts, but rather on a machine intelligence 
philosophical commitment to flexibility and a few key 
computer science ideas (that programs can be manipulated as 
normal data structures and that all the mechanisms of the 
language and system must be accessible so that they too can be 
manipulated). 

These key ideas and a long development by many talented 
people, have created an unrivaled software development 
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environment. In it, changes to programs are automatically 
marked on reformatted listings, the author and date of the 
changes are recorded, the correspondence between source and 
object modules is maintained, automatic instrumentation is 
available, non-existent code is easily simulated for system 
mock-up, extensive debugging and tracing facilities exist 
including interactively changing the program’s data and restart- 
ing it from any active module. In addition, arbitrary code can 
be added to the interface between any two modules to 
monitor their actions, check for exceptional conditions, or 
quickly alter system behavior. Also an analysis capability 
exists to determine, via natural language, which modules call a 
given one, use a variable, or set its value. 

There are many other capabilities far too numerous to 
mention but the key issues are that they are fully integrated 
into an interactive environment and that a commitment has 
been made to substitute computer processing for many of the 
programmers’ manual activities. As computer power becomes 
less and less expensive (By 1985, according to a CCIP-85 
study, hardware will represent only 15% of the cost of a 
computer system. The rest will be cost of people producing 
the software.) while people get more expensive, such a policy 
must clearly predominate. Furthermore, several studies have 
shown that software quality and cost improve as the number 
of people involved decreases. Thus, environments which 
improve programmer productively by automating certain 
functions also improve quality while reducing costs. 

6.3 Software Development 
Recommendations 

For these reasons, we recommend: 

1. That NASA immediately undertake a program to recre- 
ate within NASA the interactive programming environ- 
ment found in various machine intelligence laboratories 
for some NASA language. 

2. That NASA consider creating a modern data- 
encapsulation language (of the DODI variety) as the 
basis for this interactive facility. 

3. That NASA only undertake this project with close . 
cooperation of an advisory group drawn from these 
laboratories and with NASA personnel familiarized with 
these interactive environments via extended onsite 
training visits (approximately 6 months duration) 

4. That NASA acquire the necessary computer hardware to 
support such an environment. 


Automatic Programming. Having dealt with the current 
state of NASA’s software production and its improvement 
through utilization of existing computer science technology, 
the central issue of utilizing machine intelligence for software 
production can now be addressed. 

Software is essential to NASA’s mission. It is used to launch 
and control spacecraft, to collect, reduce, analyze, and 
disseminate data, and to simulate, plan, and direct mission 
operations. The other sections of this report address extension 
of these capabilities through incorporation of machine intelli- 
gence in these software systems. Here, holding the function- 
ality of the software constant, the use of machine intelligence 
to produce, or help produce, the software is considered. 

Even with the capabilities suggested above for improving 
the production and utilization of software, the development of 
software is still largely a manual process. Various tools have 
been created to analyze, test, and debug existing programs, but 
almost no tools exist which aid the design and implementation 
processes. The only available capabilities are computer lan- 
guages which attempt to simplify the statement of a finished 
design or implementation. The formulation of these finished 
products is addressed only by a set of management guidelines. 
As one can imagine, these manual processes with only minimal 
guidelines, unevenly followed, are largely responsible for the 
variability currently found in the quality, efficiency, cost, and 
development time of software. 

It is quite clear that significant improvements will not occur 
as the result of yet “better” design and implementation 
languages or “better” guidelines, but only by introducing 
computer tools which break these processes down into smaller 
steps, each of which is worked on separately and whose 
consistency with each other is ensured by the computer tool. 

This approach defines the field of automatic programming. 
It is based on machine intelligence technology and, like other 
machine intelligence systems, it is domain specific. Here the 
domain is the knowledge of programming: how programs fit 
together, what constraints they must satisfy, how they are 
optimized, how they are described, etc. Programming knowl- 
edge is embedded within a computer tool which utilizes the 
knowledge to automate some of the steps which would 
otherwise have to be manually performed. There is consider- 
able diversity of opinion over the division between manual and 
automated tasks. 

The critical issues, however, are that the unmanaged manual 
processes of design and implementation which currently exist 
only in people’s heads and, hence are unavailable and 
unexaminable, have been replaced by a series of smaller 
explicit steps, each of which is recorded, and that some 


portion ot them have been automated. Over time, more and 
more of these steps will be automated and the programmer’s 
role will become more supervisory. For the first time, the 
programming process will have been rationalized and recorded, 
open for examination and analysis. This will enable programs 
to be produced which are guaranteed to be consistent with 
their specification. It will eliminate the need for program 
testing and the cost and unreliability associated with undis- 
covered bugs. In addition, as automation increases, costs and 
effort will plummet. Besides the obvious advantages these 
reductions offer, a very important side benefit will occur. We 
know from instrumentation studies that large systems are not 
efficient when first implemented. Unanticipated bottlenecks 
always occur. The drastically lower costs of implementation 
will afford the opportunity for people to experiment with 
alternative implementations. These experiments wil broaden 
their experience base and enable them to develop better 
intuitions about how such implementation should be con- 
structed. Furthermore, once people have gained this knowl- 
edge, it can be incorporated as a further automation of the 
programming process. 

All of this paints a very rosy picture about automatic 
programming. The catch, of course, is that these capabilities 
don’t yet exist. The field is in a quite formulative stage. 
Impressive work is being done in a number of research labs, 
but none of these systems is close to practical use by an 
external user community. A period of research support 
followed by specialization to particular applications is needed 
if NASA is to reap any of these potential benefits. Since each 
of NASA’s missions require similar, but different, software, a 
number of such specialized automatic programming systems 
could be constructed to cover a large percentage of NASA’s 
total software effort. Recommendations: 

1 . That NASA develop a research and development plan, in 
conjunction with experts in automatic programming, for 
the creation of automated tools for the design and 
implementation Stages of the software development 
process. 

2. That NASA identify its major areas of software concen- 
tration and that specialized AP systems be developed for 
these as the field matures. 

7. Data Management Systems 
Technology 

This section briefly outlines a proposal for a coherent data 
management system which would control data acquisition, 
reduction, analysis, and dissemination. We discuss NASA’s 
NEEDS effort highlighting those areas where machine intelli- 


gence techniques may be brought to bear. We propose a 
greater emphasis on intelligent sensors to perform data 
reduction and selective data transmission, and the develop- 
ment of knowledge data bases to aid in experimentation and 
planning. * 7 

7.1 Introduction 

Current and future planned missions within NASA are 
oriented heavily towards the acquisition, dissemination, and 
analysis of data transmitted from space. The amount of such 
data is currently voluminous and will become larger by an 
order of magnitude in the 1980s. An estimate of the problem 
in the 1980s indicates that some IQ 10 bits of data/day will be 
generated for non-imaging data, while some 10 1 2 bits/day will 
be generated for imaging data. The magnitude of the data 
acquisition and dissemination problem is staggering. When one 
adds the increased sophistication in data processing needed to 
convert raw data to information and to make it accessible to 
the users one has a major problem in managing such data. 

The present NASA data management system has evolved-in 
an ad hoc manner. Continuation of an ad hoc approach will 
neither be resource-effective nor meet the needs of the 
scientific user community for the post-1980 time frame. 
Greater reliance must be placed upon computers playing a 
greater role in space. The heavy density of data, instrument 
sophistication, and miniaturized microprocessors in space man-~ 
date that resource effectiveness be achieved on and between 
missions by end-to-end management of data. This will involve 
policy, management, software, and- hardware. It is extremely 
important to have careful planning or central management 
planning for data. To achieve resource-effectiveness, the 
management of data must become a controlling force in the 
development and plans for any mission. In the following 
sections, we shall briefly describe the flow of data as it exists 
now, and the end-to-end data management concept that will 
be necessary to meet the demands of the 1980 era and 
beyond. We shall also discuss the steps required by NASA to 
meet the major challenge of the data management problem. 

7.2 State of the Art: Flow of Data Within 
Missions 

The flow of data from an instrument to a principal 
investigator in today’s technology goes from the instrument 
onboard to data processing on the ground and then is 
transmitted to a principal investigator or to facility instrument 
team members. 

Future missions will require that, instead of a one- 
instrument to one- or many-instrument users, it will be 
necessary to have the outputs from many instruments onboard 
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the spacecraft undergo data processing and provide outputs for 
many users. For example, weather and climate information, 
spacecraft thematic data, and hydrological data obtained from 
many instruments are combined with data obtained through 
non-space observations to prepare food and fiber production 
forecasts. 

Current Data Control. The management of data as it is 
obtained from the spacecraft is currently provided by onboard 
control management. They specify the data to be sensed, 
conditioned, handled, and transmitted by the instruments on 
the spacecraft. They have available to them flight direction 
data, and can make adjustments during the flight. Investigators 
who desire changes, must negotiate with the management 
team. Data obtained from a mission must undergo processing, 
sorting, and distribution. Further reduction, extraction, and 
analysis of the data takes place to transform the data into 
useful information. The transformed data and the raw data are 
stored in central repositories and distributed to principal 
investigators for further processing, analysis, and use. 

This flow of data is illustrated by the LANDSAT project. 
LANDSAT data is currently transmitted to line-of-sight 
ground stations located at Beltsville, Sioux Falls, and Gold- 
stone in the United States and in three foreign countries. The 
data is now in the planning stages to be transmitted to several 
other foreign ground stations. It is then retransmitted over the 
Space Tracking Data Network (STDN) or mailed to the 
Goddard Space Flight Center. In either case a three or four 
day delay results in the transmission receipt at Goddard. 

The raw data is assembled at Goddard where it must be 
spooled-up waiting for other data related to the flight, such as 
orbital information and fine attitude of the spacecraft. Some 
processing is performed on the data to account for such 
factors as the curvature of the Earth. Goddard then cuts a tape 
and transmits the processed dati to the EROS Data Center run 
by the Department of the Interior in Sioux Falls. EROS 
catalogs the data, stores it in its data base and distributes data 
to users on a payment basis. Upon request, EROS produces 
high quality enhanced data. However, no LANDSAT data 
conforms to any particular map. 

The LANDSAT data system for the U. S. should experience 
considerable improvement when a Master Data Processor 
(MDP) becomes available at Goddard. Such a MDP will pro- 
vide space oblique mercator projections analogous to those 
obtained from ortho-photo aircraft images. Furthermore 
it is able to use selected ground control points for each 
frame to permit sequential frame overlays from several 
passes over a particular area. The master data processor can 
solve the problem of making the digital images look right, 
and can provide temporal registration. However, the MDP 


is limited in that the images are not keyed to a specific map 
projection. 

Future Control: NASA End-to-End Data System (NEEDS). 
Projected mission data requirements exceed the present system 
capabilities to handle them. The increase in data volume can 
only partially be met through engineering technology improve- 
ments, as there promises to be a concomitant increase both in 
the number of users and complexity of sensor-related tasks. 
New demands continually arise for more complex instruments, 
better interfaces between instruments, and more sophisticated 
data processing. Many experiments and applications tasks in 
the near future will require a direct user/sensor coupling on a 
non-interference basis. This should require the development of 
dedicated, distributed microprocessors on board the space- 
craft. Other applications in space will require large centralized 
processing on the ground to effectively integrate information 
provided by several satellites. For both instances, data manage- 
ment adminstration prior to launch of each mission is needed 
to assure coordinated information acquisition and integration. 

An end-to-end data system will consist of the following 
elements: 

1. Instruments Aboard Spacecraft - which sense data, 
provide attitude and position information, Greenwich 
mean time, and provide control information on the 
downlink to ground. They are provided uplink or 
control feedback to specify information to the sensors as 
to where to look, when to look, and how to look. Such 
control may emanate from mission operations or 
directly from users who can access the sensor remotely 
from terminals. 

2. Operations - monitors system performance, develops 
housekeeping information, coordinates activities, main- 
tains a directory of the system, provides user assurance, 
and accounting information of the downlink. On the 
uplink to the spacecraft operations allocates resources, 
modifies the real time configuration, specifies flight 
maneuvers, coordinates user needs to the spacecraft, and 
provides housekeeping and repair for the spacecraft. A 
data base is maintained and updated on space flight 
information. 

3. Data Staging - receives data from operations on the 
downlink through commercial networks and packages 
the data by operating on the output of many instruments 
required by a user. The output of instruments may 
require calibration, and the packaged data must be 
distributed to multiple users. Such information must be 
amassed in short periods of time (near real-time) to 
permit the user the ability to control and change 
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instrument settings and programs. Data staging is a 
downlink operation. A data base is maintained at the 
data staging area. 

4. Distributed Users - provides near real-time data and 
investigates and screens output from instruments on the 
downlink. The data may be received directly from 
operations via commercial networks or be transmitted 
via commercial lines from the data staging area. On the 
uplink the users provide planning, scheduling, and 
control information directly to sensors that they control 
and which are independent of other instruments on- 
board the spacecraft. The user maintains a specialized 
data base. 

5. Knowledge Centers — maintains data and knowledge on 
specialized topics. The data from a mission is trans- 
mitted via commercial networks to one or more knowl- 
edge centers. The knowledge centers provide services to 
the distributed user community. They maintain not only 
mission supplied data, but data from other sources. A 
knowledge center concerning weather data would main- 
tain temperature, barometric pressure, and other 
weather data obtained by ground observation and 
measurement. The knowledge centers maintain archival 
records as well as data records. Knowledge centers will 
be linked together through commercial networks so that 
they may access one another. Users may access data in 
the knowledge centers through their remote terminals, 
and may thus perform operations on the data either at 
their own facility, or through the knowledge center 
facilities. 

Data Onboard the Spacecraft. Decisions must be made 
concerning the management of data within the spacecraft 
itself. These decisions will be a function of the particular 
mission, and whether or not there is ready access or 
interaction required with the user. For example, on a robotics 
application on Mars, because of the distance involved and the 
attendant time lag, it will not be possible to direct the robot 
from the ground, except to provide it with general goals to be 
achieved. This will require that the robot contain a large 
amount of data and information to permit it to maneuver on 
Mars. It will have to have the following, as a minimum: 

1. Information as to the location, size, and composition of 
objects. 

2. A model of the terrain. 

3. General rules about the relationships between objects. 

Item 1 can be supplied partially from ground analysis of 


images and by measurements taken by the robot itself. Item 2 
can also be obtained partly on the ground and partly by the 
robot. General rules and axioms, on the other hand, needed to 
devise and effect plans, must be provided by the ground. 
Regardless of where the data and information arises, it is 
essential that capabilities exist within the robot to store, 7 
retrieve, and manipulate large amounts of data. Hence, a data 
management system will be necessary as a part of the robot, 
itself. A non-conventional data management system will be 
required— one that can do deductive searching and plan 
formation. Hence, it will require an extensive knowledge base 
system, a semantic network, and an inference mechanism. 

Even if the spacecraft is to be used to transmit data to 
Earth, where a data management system exists, there is 
considerable planning that can be accomplished so as to 
improve the efficiency of data acquisition. For example, the 
following topics should be addressed on every mission: 

- Need for data compression to minimize the number of 
bits transferred and to conserve communication 
channels. 

— Data needed by the investigator as obtained by his 
instrument and other instruments. For example, if an 
image is to be transmitted to Earth, related data needed 
by the user should be transmitted at the same time. If, 
for example, the spacecraft is orbiting Earth, the precise 
location of the space vehicle, the angle of the Sun, and - 
the angle of the camera tilt would be required as a 
minimum. Incorporating such data will save considerable 
effort on the ground by a small amount of processing in 
space. 

— Use of data by the scientist. Knowing the use to which 
the scientist will make of the data can determine the 
needs for archival data, qnd whether or not processing of 
the data either on the spacecraft, at mission control, or 
at a central repository would be of help to the scientist. 

Data management planning for the spacecraft is, therefore, one 
important element of a data management plan for a mission. 

Operations. Operations plays a central role on each missionr 
Hence, the data management system requires careful attention 
here. Design of the command system to the spacecraft must 
include consideration of integrity constraints. Such constraints 
assure that commands given to the spacecraft are legal ones, 
and that inevitable ground errors are minimized. Users who 
have sensor control on the spacecraft should have their 
commands passed via commercial digital networks to the 
operations group where the user command may be overridden 
if deemed necessary by operations, and if not overridden, the 
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command is scheduled for transmittal to the spacecraft, 
logged, and the user notified automatically as to when the 
command is to be transmitted. The delay between user 
transmittal and override should be in the order of a few 
minutes at most as user/sensor control should take place 
automatically only when the sensor is independent of other 
sensors onboard the spacecraft. 

Operations will require a sophisticated message switching 
system to determine where space data is to be transmitted. It 
must also have a data base management system to be able to 
store and retrieve data retrieved from a mission. The amount 
of data for storage and retrieval at the operations facility will 
depend on the mission. Data for a few days receipt can be 
maintained while data of more than a few days can be retained 
at knowledge centers and retrieved through the network as 
needed. 

Data Staging. Data staging provides many-instruments to 
many-user capabilities. Data from many instruments are 
transmitted via commercial lines to the data staging area. The 
data undergoes data processing to transform it into usable 
modules for many users remotely connected to the data staging 
area. Capabilities should be provided to allow user access to 
raw and processed data. The users should be able to specify to 
the data staging area the operations to be performed on the 
data. The results can be placed into operational data sets 
consisting of processed data. All users should have access to all 
operational data sets. It will not be unusual for the many users 
to want common data. Making available the operational data 
sets to all users could save duplication of processing. 

The management of the data staging facility should assure 
that the same function is not applied many times and should 
recognize common requests by multi-users for the same 
processing functions. The data staging area should transmit 
processed data not only to users on an automatic basis, but to 
the knowledge centers for archival purposes. Data staging may 
be viewed as a switching and processing center. Its primary 
function is to operate upon spacecraft data and transmit the 
results to the user. It will have to maintain data base directory 
services, as required by the user. 

Users. The purpose of a mission will be to supply 
instrument information to the user population. The data 
staging area and operations can supply processed data and raw 
data to the user. Neither operations nor data staging can be 
expected to perform all the processing required by the user. 
Users will require their own processors and a means for storing 
and retrieving large quantities of data. One would not 
anticipate that users would require their own archival system 
as such a function can be provided by the knowledge centers. 


The range of needs for the user cannot be anticipated in 
advance with respect to data processing functions. However, 
some users will require conventional data base management 
systems. In the former, there should be a major effort to 
standardize the data base management systems so that each 
user does not build or buy his own system. User proposals 
should be scrutinized carefully by a data base management 
system organization. Knowledge base systems will become 
prevalent in the 1980s. One can incorporate a knowledge base 
capability with a conventional data base management system 
(primarily relational data base systems), or build a special 
purpose system using one of the artificial intelligence lan- 
guages. Such systems will be needed to do image analysis and 
picture processing, and to extract new data relations from 
given relations. Knowledge base systems can be general, but 
will require specific details based on the particular application. 
For instance, knowledge base systems could contain many of 
the features required for robots to accomplish their jobs on 
remote planets. 

Knowledge Base Centers. Knowledge base centers will 
require three different types of data base management 
systems: 

1. Archival. 

2. Generalized Data Base Management System. 

3. Knowledge Base System. 

Knowledge base centers should contain archived data relating 
to missions and related data. They must be able to retrieve 
requests and transmit responses to users who can access the 
knowledge centers through remote terminals. Requests can be 
specific — such as to retrieve the tape for the fifth orbit of 
data transmitted on a certain mission. They can be general, 
such as to send all tapes where the infrared reading was 
between certain limits on a mission. Thus, knowledge centers 
will have to maintain indexes of the data, and must perform 
some classification of data as it is stored in the system. 

Knowledge centers should also contain conventional data 
base management systems used to store and retrieve data 
conveniently contained in records. Whereas user data base 
management systems should fit on minicomputers, large scale 
computers and sophisticated data base management systems 
will be required. A distributed network of data base manage- 
ment systems should be investigated to iterate the various 
knowledge centers. 

Sophisticated knowledge base systems which contain 
specific facts, general rules, and world models (e.g., a map 
containing roads that will be used as a template to match 
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against images and be used to detect roads in images) will be 
required. By a general rule is meant a statement of the type, 
“If object 1 is to the left of object 2 and object 2 is to the 
left of object 3, then object 1 is to the left of object 3.” 
Complex knowledge base systems may also have to interface 
with sophisticated mathematical models. The area of knowl- 
edge base data systems is important and will require additional 
research support. 

7.3 Opportunities and Recommendations 

The NEEDS effort provides the potential for improving 
NASA use of data. Part of the improvement can come about by 
developing intelligent sensors and digital computer systems on- 
board spacecraft. Another part of the improvement can come 
about by developing an efficient ground communication/data 
processing system. 

Intelligent sensors and digital computer systems onboard 
spacecraft can: 

— Send back processed, rather than raw data. 

— Obviate the need for documenting data on the ground as 
attitude, Greenwich mean time, and other information 
can be sent back to Earth with the sensor data as it is 
collected. 

— Decrease the data flow as only relevant data need be 
returned to Earth (for example, if image data of Earth is 
to be sent and the scene is obstructed by cloud cover, 
the space computer should detect this occurrence, and 
not send the useless cloudy image). 

— Respond to changes as to what should be collected as 
received in commands from the users. 

— Compress data so that needless or redundant informa- 
tion does not overload the communications channel. 

— Allow direct user-remote control. 

An efficient ground communication/data processing system 
can: 

— Permit near real time processing of data. 

— Provide enhanced user services. 

— Transmit processed multisensor data to users. 

— Retrieve archival data more readily. 


It is estimated that substantially less than 10% of all data 
received from space is ever used. By decreasing the amount of 
useless data by introducing intelligent sensors, and by pro- 
viding better data management facilities to store, retrieve, and 
manage real-time and archival data, substantially greater use of_ 
data may be anticipated. . 

Although the NEEDS effort could yield considerable 
benefits for NASA, the efforts being conducted do not appear 
to be promising. NASA is taking a bottom-up approach to 
NEEDS. That is, rather than developing a comprehensive 
systems engineering approach to achieving such a system, a 
piecemeal approach is being taken. Various technologies are 
being investigated in an attempt to develop NEEDS. Although 
new technologies are clearly necessary, there is scant investiga- 
tion into how they are to be brought to bear in a final system. 
To achieve an end-to-end data system that will provide users 
greater control over sensors, and will enhance the acquisition 
and dissemination of information from space, requires a 
systems approach in addition to a technology approach. The 
work will require significant planning at the management level, 
sophisticated software developments, and matching hardware 
capabilities. At the present time there appears to be no 
appreciation of the complexity of the NEEDS effort and the 
importance of engineering an entire system. Not only are 
intelligent sensors and reliable microprocessors needed in space 
but the management and flow of data from the spacecraft to 
the users and archival stores is essential. The following are- 
some specific recommendations. 

Management Recommendations 

Data Management Plan Coordination Group. A centralized 
group within NASA consisting of computer scientists will be 
necessary to provide overall plans for managing mission data. 
The group is needed to assure coordination between missions, 
to minimize duplication, and to determine the general tools 
that should be provided to user scientists by NASA. They 
should be concerned with assuring that there is a cost- 
effective, appropriate data management plan on all missions. 
They should further be concerned with the acquisition and 
development of equipment and software. 

Mission Data Management Plan and Coordination Group. 
The mission-oriented group should provide plans as to how 
end-to-end data management will be achieved on a mission. 
They should be concerned with how to integrate the mission 
objectives with current and planned data management systems 
within NASA. The group should also consist of computer 
scientists and should review all data management plans with 
the centralized group. 
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Technical Recommendations 

NASA End-to-End Data Management System. The NASA 
end-to-end management system outlined in this report must 
undergo considerable planning and detail before it can become 
a reality. A systems engineering group consisting of computer 
scientists and hardware experts is needed now to achieve an 
effective system concept and implementation. 

Knowledge Centers. The concept of knowledge centers 
must be explored carefully from a technical level to determine 
how they are to be achieved. Careful consideration will be 
required to develop or obtain appropriate archival and data 
base management systems. Insufficient attention is being 
placed on this aspect of NEEDS. 

Knowledge Base Systems. Research support is required for 
work in this area. Emphasis should be placed on enhancing 
relational data base systems so that they can be used in 
conjunction with problem solving systems needed to achieve 
knowledge base system capabilities. A knowledge base system 
can be achieved, and is necessary for NEEDS. Again, 
insufficient attention is being placed on such systems. 

8. Man-Machine Systems 
Technology 

This section deals with the three major components of any 
advanced man -machine system: modeling human control 
processes, the design of interfaces between human and 
intelligent computer systems, and the design of the 
manipulators themselves. 

8.1 Introduction 

The major deficiency in the application of machine 
intelligence and robotics to the special problems of NASA is a 
lack of knowledge about how to design effective man-machine 
systems. This deficiency is fundamental and is based on a lack 
of knowledge of human processes of machine control and of 
the interface. For example, teleoperators are understood 
neither at the level of the human control processes nor at the 
level of determining an appropriate design of manipulation and 
the proper design of the information interface between human 
and teleoperators. 

There does exist considerable knowledge about each of the 
fields that contribute to their problems. Cognitive psychology 
has developed considerable expertise and knowledge about the 
structures of human information processing, most especially 
those of perception, language, and memory. Workers in 
control theory have developed sophisticated procedures. 


Human-computer interaction has been widely studied. The 
weaknesses lie in the lack of useful predictive models of the 
interaction of these components. 

Despite the fact that human-machine interaction is critical 
to the success of almost all of NASA’s missions, NASA’s 
present organizational structure does not appear to accommo- 
date research on man-machine systems or man-computer 
interaction required. NASA Life Science programs (under 
Office of Space Science) have concentrated on medical, 
physiological, botanical, bacteriological, and biochemical 
disciplines. OAST has sponsored some man-machine research, 
but primarily as related to aeronautics. The NASA centers 
have done more man-machine research on an ad hoc basis, as 
required by the project offices. Thus, human information 
processing, man-computer cooperation, and man-machine 
control basic research has tended to “fall between the cracks.” 

The result is that in current and planned missions there is 
confusion about what mechanisms are most appropriate for 
communication in control and feedback of information to 
human operators, and what constitute appropriate tasks for 
humans and for machines. So far, the ambiguous status of the 
human-machine systems research has not led to any grave 
difficulties, primarily because the conservative engineering 
philosophy of NASA helps avoid major difficulties. The lack 
does severely limit applications, however. 

Fundamental improvements in human-machine interaction 
will come about only if NASA leads the way, supporting basic 
research directed at the fundamental problems, developing 
applied laboratories, developing new conceptualizations and 
new techniques. This work must be mission independent, 
developed from broadly based fundamental research and 
development programs that are not subject to the complex 
limitations posed by mission-oriented studies. There must be 
better means for life science and technology organizational 
components to interact in bringing a more rigorous focus on 
these crucial long-range research problems. 

8.2 Human Information Processing 

Human information processing is the study of the psycho- 
logical mechanisms underlying mental functioning. Memory, 
problem solving, language, perception, thinking — these are 
some of the major areas studied. In the past decade there have 
been sufficient systematic advances in our knowledge that 
these areas now constitute perhaps the best understood 
problems in contemporary psychology. Studies of attention 
are of special importance to problems faced by NASA. 
Humans have limited mental resources, and the deployment of 
these resources constitutes an important part of behavior. The 
limitation appears to apply primarily to conscious control. 
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Tasks that require conscious decision-making or control can 
suffer in times of stress or when other tasks must be 
performed or thought about simultaneously. When several 
tasks simultaneously demand a share of conscious resources, 
deterioration of performance results. Tasks that are learned 
well enough that they appear “automated” seem to suffer 
little as a result of other activity. 

Despite the relative amount of knowledge about human 
processing mechanisms and control structures, we know 
surprisingly little about aspects that are relevant to the 
problems faced by NASA. We do not know enough about the 
nature of conscious and subconscious-control mechanisms. We 
do not know enough about the various modes of operation of 
the human. We know very little about the human’s ability to 
interact with and control the environment. Almost all our 
knowledge deals with the processing of arriving information, 
or the operation of the human as an element of a control 
structure. This leaves unanswered much of importance. The 
human has two cortical hemispheres, each one appearing to be 
specialized for different types of processing. One hemisphere 
appears to be serial, the other more parallel or distributed. We 
have just begun to explore the implications of these differ- 
ences; exactly how they apply to control issues is not 
understood, although there are obvious implications. 

This area of knowledge about the human has many 
potential applications for NASA. On the spacecraft, at mission 
control, onsite during a mission, all these situations require 
different aspects of human capability. We find no evidence 
that NASA is engaged in systematic study of these issues. Yet 
they are critical to the success of NASA’s missions, the more 
so as missions become longer, more complex, with space 
repair, manufacture, and mining as possible tasks. 

8.3 Human-Machine Control Processes 

Whether humans are physically present in space or not, 
their intelligent control and supervision are fundamental 
requirements for any space mission within this century. Their 
sensory, cognitive, and motor-control systems can play useful 
roles in combination with machine intelligence. The combina- 
tion of human and computer has potential for more capability 
and reliability than either by itself. But realization of this 
synthesis requires much progress at the level of basic research 
and application. 

A primitive discipline and art of human-machine systems 
does now exist, although it is unevenly developed. There are a 
small number of texts and several scientific journals. There are 
regular workshops and annual meetings. In such areas as the 


performance of the pilot and air traffic controller in aviation, 
the work has been sufficiently successful that aircraft manu- 
facturers and government regulators determine their hardware 
and procedures to a significant degree from research findings.' 
But the most sophisticated of this research and the most * 
successful empirical applications have been devoted to situa- 
tion, where the human is in continuous control of the system. 
As the computer becomes more capable of intelligent opera- 
tion, the human operator becomes more like a “manager” or a 
“supervisor” than an active in-the-loop controller. This new 
“supervisory control” mode of operation is not well under- 
stood. Ames and Langley both have in-house and university 
research programs to study these new roles in the context of 
aviation. This is a start, but more needs to be done, especially 
directed towards the particular problems faced by space 
programs. 

8.4 Human interactions For Ground-Based 
Missions 

A large fraction of NASA’s budget is spent on an extensive 
system for monitoring, controlling, and processing data fronTa 
large number of Earth-orbital and deep-space vehicles. The 
Study Group severely questions much of this work. There has 
not been sufficient research done on the proper balance 
between human and machine judgment and analysis, not 
proper studies of the appropriate balance between decisions 
made in space and on the ground at mission control. While the 
new color-graphic computer based displays offer much 
possibility, their appropriate display formats are not well 
understood. The role of intelligent programs is significantly 
underestimated. 

8.5 Teleoperators 

Ever since the development of remote manipulators for 
nuclear applications in the early 1950s, it has been clear that 
teleoperators can be used to extend the human’s vision, 
mobility, and manipulation capability into space, undersea, 
and difficult environments. Nonetheless, compared to the 
magnitude of the potential saving over sending man into space 
to sense and manipulate, there has been little developmental 
work on the control factors of teleoperators. (In similar 
fashion, there has been surprisingly little development of the 
manipulators themselves, but this is covered in a different part 
of this report.) When continuous remote control is not 
possible because of signal transmission time delays, restrictive 
“move and wait” strategies are required for remote operation. 
This can lend to awkwardness and instability,especially when 
force or touch feedback is used. It is clear that to perform 
large space construction tasks or planetary mining, etc., in 
other than near-Earth orbit, this type of control is intolerable. 
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The Study Group finds that there has been surprisingly 
little advancement in manipulator development since the 
1960s, though recently some significant theoretical contribu- 
tions to kinematics and control of manipulators are evident in 
both U.S. and Soviet literature. Current manipulators do not 
have the reach, precision, or sense ability required for space 
assembly. End effectors are awkward and tool changing is 
slow. Even use of the human operator in a supervisory mode 
(with a computer doing much of the control), requires close 
contact with the task via television and force-reflecting sensing 
mechanisms. Manipulation dexterity must be understood 
better at a fundamental level. 

Thus, assuming semiautomatic assembly, where the human 
plays a supervisory or decision-making role, two things are 
needed: development of intelligent computer control systems, 
and the understanding of the role of the human in this mode 
of operation. There has been insufficient research to under- 
stand the proper interface that should exist when the human 
plays this higher-order role in the feedback system. 

8.6 Spin-Offs 

NASA sponsored research dealing with capabilities of the 
human-machine interaction are bound to lead to important 
spin-offs. Increased understanding of sensory and control 
mechanisms will be important for the development of sensory 
prostheses, for development of new systems to aid in the 
control and management of complex tasks, and perhaps 
systems capable of expanding our cognitive abilities by 
cooperative interaction with machines. 

Research on man-machine questions should have direct 
application to various non— NASA needs such as unmanned 
mining, deep ocean exploration and construction, disposal of 
nuclear waste, and intracorporeal surgery performed remotely 
using optical fiber bundle devices. New techniques of indus- 
trial automation are a possible outgrowth of such research. 


8.7 Recommendations 

NASA must reassess the role of the human in the control 
of sophisticated systems. Low level, detailed control will 
probably best be done by intelligent computers, giving the 
humans higher level, more complex decision making and 
administrative responsibilities. 

NASA should develop a strong research program on human 
information processing, on man-machine control processes. 


and on the interface issue. There is special need for study of 
situations with high data rates, with a need for rapid decisions, 
and with stress. There should be direct interrelationships 
between NASA’s mission needs and the research programs. 
Formal ties with the university and industrial research 
communities would be useful, with collaborative research 
being potentially of great value. A scientific visitors’ program 
could educate NASA scientists to existing research; young 
university personnel would become educated about NASA s 
particular problems. 

Note that many of the problems faced by NASA occur in 
rich, complex environments. University research laboratories 
are unlikely to have the facilities to simulate these conditions. 
Accordingly, if NASA laboratories were made available to 
researchers from university settings, with sufficient time and 
resources, it might be possible simultaneously to increase the 
level of basic understanding of problems dealing with man- 
machine interface, and also to get direct results relevant to 
NASA. Thus, experiments on simultaneous attention, or on 
performance under stress, or on the control of teleoperators, 
using NASA simulators can be expected to make it possible for 
new kinds of phenomena to be studied. NASA has the 
facilities, but has not used them for general development. 
Universities have the technical expertise, but lack the facilities 
to do research relevant to the real needs of NASA. 

In addition, the Study Group recommends: 

1. That research in the areas of man-computer cooperation 
and man-machine communication and control be accel- 
erated, in view of the long-range critical need for basic 
understanding of these problems. This is in lieu of 
supporting such research primarily on an ad hoc and 
mission-oriented basis. 

2. That NASA organizational entities representing life 
sciences and the technological disciplines of computers 
and control develop better cooperative mechanisms and 
more coherent research programs in man-machine inter- 
action to avoid the “falling between the cracks” 
problem. 

3. That future NASA missions realize the full potential of 
teleoperators by developing improved means for “super- 
visory control” of robotic teleoperators. Thus the 
human operator on Earth can benefit from what the 
teleoperator senses, and can intermittently reprogram its 
computer as necessary. In this way the advantages of 
human intelligence in space may be had without the 
costs and risks of bodily presence there. 
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9. Digital Communication 
Technology 

There is an aspect of NASA activity that did not receive 
much attention at any of the workshop meetings. This 
involves the transfer of information among a complex, 
geographically and institutionally disparate set of groups that 
need to exchange messages, ideas, requirements, documents, to 
keep informed, plan activities, and arrive at decisions quickly. 
The clerical infrastructure to support this network of activity, 
not counting the information users and generators, managers] 
engineers, and scientists at the nodes, must account for 
approximately 15% of NASA’s budget for both inhouse and 
contractor personnel. If total personnel costs are 2/3 of the 
total budget, then the total costs for the mechanics of this 
information exchange is several hundred million dollars per 
year. A computer-based communication system can make 
significant improvements in the quality of information trans- 
fer, and probably increase the productivity of the information 
exchange infrastructure. 

The implementation of such a system would not be 
predicated on new developments in artificial intelligence. It 
would use tools that are common practice at AI nodes of the 
ARPA network and are part of the developing technology of 
digital information and word processing. Once such a develop- 
ment were carried out, it would provide the data base that 
could take advantage of sophisticated techniques of informa- 
tion retrieval, semantic search, and decision making as they 
became available. Costs can be estimated accurately from 
systems at those AI sites on the ARPA network. 


The functions to be carried out are the Directory and File 
management facilities described in the TENEX Executive 
manual and programs like SENDMESSAGE, MESSAGE, TV 
EDIT, and BULLETIN BOARD. These would operate inter-* 
actively. Programs like SPELL and PUB would be offered. - 
Teleconferencing facilities and input of documents with OCR * 
devices could be implemented. These services offer mail to 
distribution lists, creation and editing of documents, spelling 
checking, and publication formatting, with book quality print, 
arbitrary fonts and graphics, with quick turnaround. Docu- 
ments would be instantly available for online access and 
continuous updating. 

In addition to the cost savings, which would probably be 
large, there would be the following: 

On-line documentation and record-keeping system, with 
computer readable documentation, instant availability to 
updated versions, quick copies of documents, using 
hardcopy devices. 

- Document preparation services including editors, spelling 
checking, publishing and formatting programs (e.g., 
PUB); with arbitrary fonts and book quality text, with 
short turnaround time. 

— Benefiting from on-line access to catalogs of images, data - 
bases, and images; communication and sharing of 
algorithms and evaluation of algorithms would be 
enhanced. 
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Section VI 

Conclusions and Recommendations 


We believe that NASA should institute a vigorous and long- 
range program to incorporate and keep pace with state-of-the- 
art developments in computer technology, both in its space- 
borne and its ground-based computer systems; and to ensure 
that advances, tailored to NASA’s mission, continue to be 
made in machine intelligence and robotics. Such advances will 
not occur of their own accord. Many NASA requirements in 
computer architecture and subsystem design will in turn have a 
stimulating effect on the American computer and micropro- 
cessor industry, which now faces an extremely strong chal- 
lenge by foreign competition. We believe that an agency such 
as NASA, which is devoted to the sophisticated acquisition 
and analysis of data, must play a much more vigorous role in 
the design and acquisition - of data processing systems than has 
been its practice in the past. 

These findings are supported by the recommendations 
independently arrived at by the Space Science Board of the 
National Academy of Sciences 7 : 

From experience with mission operations on 
previous space missions, we anticipate that there 
will be even greater demands on data acquisition, 
processing, and storage; on mission coordination; 
and on interaction with the spacecraft and scienti- 
fic experiments. The complex nature of mission 
operations and the long time scale required to pre- 
pare, certify, and transmit routine commands in 
previous missions indicates that substantial changes 
will be necessary. We believe that significant tech- 
nical and managerial advances must be made in 
anticipation of future planetary missions, in order 
to provide reliable, more efficient, and lower cost 
systems for operation of the spacecraft and scien- 
tific instruments. 

The testing of these systems on the ground as 
operational units including the participation of 
science teams should be carried out well before the 
mission. These tests should include the operation 
with possible failure modes. These approaches will 
be more important in the future when extensive 
coordination must be obtained by use of more 
intelligent or autonomous control systems. The 


7 Strategy for Exploration of the Inner Planets: 1977-1987, Com- 
mittee on Planetary and Lunar Exploration, Space Science Board, 
Assembly of Mathematical and Physical Sciences, National Research 
Council, National Academy of Sciences, Washington, D.C., 1978. 


choice of onboard preprocessing versus earth-based 
processing and the utility of block telemetry for- 
matting and distributive data handling and control 
subsystems will require assessment. In the past, 
computing facilities and command and data- 
processing software were not always efficient, and 
early attention was not given to overall system 
design in laying out missions. Further, experience 
with past and current spaceflight missions has 
shown that complicated systems with higher levels 
of intelligence are difficult to handle without 
substantial experience. 

We are apprehensive about recommending that 
radical new approaches be utilized without further 
study; nonetheless, it appears that some significant 
changes must be considered. Recognizing that mis- 
sion operations is the key to the success of any 
complicated undertaking, we therefore recommend 
that an assessment of mission operations, including 
spacecraft control and scientific instrument and 
data management and the design and management 
of software control systems, be studied by the 
Agency at the earliest possible time and the evalua- 
tion be presented to the Committee. 

The Federal Data Processing Reorganization Project has 
indicated serious failings in virtually all government agencies 
in the utilization of modern computer technology. While the 
National Science Foundation and the Advanced Research 
Project Agency (ARP A) of the Department of Defense con- 
tinue to support some work in machine intelligence and 
robotics, this work, especially that supported by ARP A, is 
becoming more and more mission-oriented. The amount of 
fundamental research supported by these agencies in machine 
intelligence and robotics is quite small. Because of its mission, 
NASA is uniquely ’suitable as the lead civilian agency in the 
federal government for the development of frontier technology 
in computer science, machine intelligence, and robotics. 
NASA’s general engineering competence and ability to carry 
out complex missions is widely noted and admired. These are 
just the capabilities needed by any federal agency designated 
to develop these fields. Although we are hardly experts on 
federal budgetary deliberations, it seems to us possible that 
incremental funds might be made available to NASA, over and 
above the usual NASA budget, if NASA were to make a com- 
pelling case for becoming the lead agency in the development 
of frontier technology in computer science and applications. 


53 


The beneficial impact of such a step for the industrial economy, 
for other branches of government, for the public well-being' 
and for NASA’s own future effectiveness in an era of tight 
budgets is likely to be substantial. 

We, the NASA Study Group, here state our overall con- 
clusions and recommendations. Our report is complete with 
supporting documentation leading to these conclusions and 
recommendations. 


A. Conclusions 

Conclusion 1. NASA is 5 to 15 years behind the leading edge 

in computer science and technology. 

There are some examples of excellence, but in general 
we find NASA’s use of computer technology disappointing. 
NASA installations still employ punched-card-based batch 
processing and obsolete machine languages. There is no 
NASA nationwide computer network and no widespread 
time-sharing use of computers. Although Viking was a 
brilliant technological success, given its design limitations, 
Viking’s use of robotics technology and in situ program- 
ming was rudimentary. These techniques must be greatly 
advanced for the complex missions of the future, both 
planetary and Earth orbital. Most Earth-satellite and much 
planetary exploration imaging data remains unanalyzed 
because of the absence of automated systems capable of 
performing content analyses. Even missions being planned 
for the 1980s are being designed almost exclusively for 
traditional data collection with little apparent provision 
being made for automated extraction of content infor- 
mation. 


Conclusion 2. Technology decisions are, to much too great a 
degree, dictated by specific mission goals, powerfully impeding 
NASA utilization of modem computer science and technology. 
Unlike its pioneering work in other areas of science and tech- 
nology, NASA ’s use of computer science and machine intelli- 
gence has been conservative and unimaginative. 


Strict funding limitations and an understandable aversion 
to mission failure cause mission directors to settle for 
proven but obsolete and, ironically, often very expensive 
technologies and systems. As machine intelligence and 
robotics continue to advance outside of NASA, the conse- 
quences of these traditions for higher cost and less efficient 
data return and analysis become more glaring. The inertial 
fixation on 15-year-old technologies, including slow pro- 
cessors and very limited memories, strongly inhibit NASA 


contact with and validation of advanced machine intelli- 
gence techniques. Flight minicomputer memories are typ- 
ically at 16,000 or 21,000 words, enormously restricting 
options. (For example, a very large number of scientific 
targets on Jupiter and the Galilean satellites, which other-_ 
wise could be acquired, had to be abandoned because of- 
the memory limitations of the Voyager onboard computer.) 
But million byte memories are now routinely employed and, 
once space-qualified, could provide enormous flexibility. 

Because of the long lead times in the planning cycle, many 
decisions relating to computers are made five to seven years 
before launch. Often, the computer technology involved is 
badly obsolete at the time hardware is frozen. Further, no 
deliberate effort is made to provide flexibility for software 
developments in the long time interval before mission 
operations. (Uplinking mission programs after launch is a 
small but significant step in the right direction.) 


Conclusion 3. The overall importance of machine intelligence 
and robotics for NASA has not been widely appreciated 
within the agency, and NASA has made no serious effort to 
attract bright, young scientists in these fields. 

In 1978/1979, the Space Systems and Technology Advisory 
Committee of the NASA Advisory Council had 40 mem- 
bers. Not one was a computer scientist, although two had 
peripherally related interests. Few, if any, of the best com- * 
puter science PhDs from the leading academic institutions 
in the field work for NASA. There is a looped causality 
with NASA’s general backwardness in computer science 
(Conclusion 1): An improvement of the quality of computer 
science at NASA cannot be accomplished without high 
quality professionals; but such professionals cannot be 
attracted without up-to-date facilities and the mandate to 
work at the leading edge of the field. 

The problems summarized in Conclusions 1 and 3 cannot 
be solved separately. 


Conclusion 4. The advances and developments in machine 
intelligence and robotics needed to make future space missions' 
economical and feasible will not happen without a major long- 
term commitment and centralized, coordinated support. 

A table of various planned future space missions and an 
estimate of technology development efforts needed to 
automate their system functions was given in Section IV 
(see Table 4-1). Without these automatic system functions, 
many of the missions will not be economically and/or 
technologically feasible. 
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B. Recommendations 

Recommendation 1. NASA should adopt a policy of vigorous 
and imaginative research in computer science, machine intelli- 
gence, and robotics in support of broad NASA objectives. 

The problems summarized in the preceding list of conclu- 
sions have solutions. They require, most of all, an awareness 
that the problems exist and a commitment of resources to 
solve them. Table 6-1 gives the published R&D budgets of 
the seven largest computer corporations in the United 
States. In all cases, the total R&D spending is greater than 
42% of total profits. The advanced R&D budget would be 
only a fraction of this amount. Leading corporations in 
' computer science and technology characteristically spend 
5.4 percent of gross earnings on relevant research and devel- 
opment. The same percentage of NASA’s annual expendi- 
ture in computer-related activities would suggest an annual 
NASA budget for research in computer science, machine 
intelligence, and robotics approaching one hundred million 
dollars. An expenditure of half that would equal the com- 
bined annual budget for this field for ARPA and the 
National Science Foundation. If NASA were selected as 
lead agency (or lead civilian agency) for federal research 
and development in computer science and technology, 
such amounts might not be at all impractical. Any signifi- 
cant expenditures should have detectable benefits in three 
to five years, and very dramatic improvements in NASA 
programs in 10 years. If NASA were to play such a lead 
agency role, one of its responsibilities would be to study 
the long-term implications for individuals and for society of 
major advances in machine intelligence and robotics. 

Recommendation 2. NASA should introduce advanced com- 
puter science technology to its Earth orbital and planetary 


missions, and should emphasize research programs with a 
multimission focus. 

A balance is needed onboard NASA spacecraft between 
distributed microprocessors and a centralized computer. 
Although function-directed distribution of processors 
might be useful, such architectures should not preclude the 
use of these computing resources for unanticipated needs. 
Distributed computer concepts emphasizing “fail-safe” 
performance should receive increased attention. For exam- 
ple, in the case of failure of a computer chip or a unit, a 
long-term goal is to effect migration of the program and 
data to other working parts of the systems. Such fail-safe 
systems require innovative architectures yet to be devel- 
oped. Dynamically reconfigurable processors with large 
redundancy are badly needed in NASA. 

NASA relies on 256-bit computer memory chips; 16,000 
bit and 64,000 bit chips are currently available. A million- 
bit chip is expected to be available within a few years. The 
cost of space-qualification of computer hardware may be 
very high, but the possibility exists that high information- 
density chips may already work acceptably in the space 
environment. We recommend that NASA perform space 
qualification tests on the Shuttle of multiple batches of 
existing microprocessors and memory chips. 

These two examples of developments in computer science 
and technology will have applications to many NASA mis- 
sions. We also recommend a transitional period in space- 
craft computer system design in which existing minipro- 
cessors and new microprocessors are both utilized, the 
former as a conservative guarantor of reliability, the latter 
as an aperture to the future. 


Table 6-1. R&D of the Big Seven Computer Companies 





R&D EXPENSE 


Company 

1977 Sales 
in millions 
of dollars 

1977 Profits 
in millions 
of dollars 

Actual 
in millions 
of dollars 

As a percent 
of Sales 

As a percent 
of Profits 

Cost of 
Employees 
in millions 
of dollars 

IBM 

Sperry Rand 

Honeywell 

NCR 

Burroughs 
Control Data 
Digital Equipment 

18,133 

3,270 

2,911 

2,522 

2,901 

1,493 

1,059 

2,719 

157 

134 

144 

215 

62 

109 

1,142 

168 

152 

118 

122 

73 

80 

6.3 

5.1 

5.2 
4.7 

4.2 
4.9 
7.6 

42 

107 

113 

82 

57 

118 

73 

3682 

1965 

2009 

1845 

2386 

1592 

2218 

Composite 

32,289 

3,540 

1,855 

5.4 

85 

2242 


* 
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In planetary exploration, . . it is clear . . . that more 
advanced mission techniques and instrumentation are 
required to fulfill the science strategy and achieve the 
objectives...” of intensive study of a planet. 8 Surface 
rovers and return-sample missions will be required to meet 
the science goals for Mars, the Galilean satellites of Jupiter, 
Titan, and perhaps Venus, as well as for investigation of 
such specific locations on the lunar surface as putative 
volatile-rich deposits at permanently shaded regions of the 
poles. With the exception of the Lunakhod and other 
Luna-class missions of the Soviet Union, there is little 
experience with such systems. Because of the long lead 
times and the complex nature of rover missions, they pro- 
vide an ideal testing ground for the implementation of the 
multimission focus of some of our recommendations. 

Recommendation 3. Mission objectives should be designed 
flexibly to take advantage of existing and likely future tech - 
nological opportunities. 

Hardware should be designed to exploit state-of-the-art 
software and likely near-future software developments. 
Adoption of this recommendation implies a careful re- 
examination of missions currently in the planning stages. 
This recommendation applies not only to spacecraft systems 
but to ground-based computer systems as. well. The man/ 
machine interface, both in Shuttle systems and in mission 
operations ground equipment, has not, in our opinion, 
been optimized. In routine mission operations, particularly 
in mission crisis management, there is a severe short-term 
competition for human attention and intellectual resources. 
The problem is a combinatorial one, requiring systematic 
and exhaustive failure-mode analysis, which can be opti- 
mally provided by computer systems, via a probability 
analysis, analogous to existing computer programs in medi- 
cal diagnosis. In addition to their value in crisis manage- 
ment, such computer systems will lead to the optimization 
of subsequent missions. 

Recommendation 4. NASA should adopt the following plan 
of action: 

(a) Establish a focus for computer science and technology 
at NASA Headquarters for coordinating R&D activities. 

The pace of advance in computer science and tech- 
nology is so great that even experts in the field have 
difficulty keeping up with advances and fully utilizing 
them. The problem is, of course, much more severe for 
those who are not experts in the field. By establishing 


6 Ibid, p. 39. 


a program in computer sciences, NASA can ensure that 
there is a rapid transfer of new technology to NASA 
programs. Space exploration offers a unique environ- 
ment in which to develop and test advanced concepts 
in this discipline. 

This leads to the following specific recommendation: 
NASA should consider Computer Science and Tech- 
nology sufficiently vital to its goals to treat the subject 
as an independent area of study. The specific concerns 
of this field, enumerated below, should become research 
and technology issues within NASA on the same basis 
as propulsion technology, materials science, planetary 
science, atmospheric physics, etc. This means the 
creation of a discipline office for computer science 
with interests in the major subdisciplines of the field 
and with appropriate contacts within NASA. A suitable 
budget and program of research and technology grants 
and contracts would provide the focus in this field the 
Study Group has found lacking in NASA. On the one 
hand, it would help make the outstanding workers in 
the field aware of and interested in serving NASA’s 
needs. Graduate students participating in such "a 
research program would become a source of future 
employees for NASA centers and contractors. On the 
other hand, it would provide NASA Headquarters 
with a better awareness of the potential contributions 
of computer science to its programs. To be effective, 
the initial operating budget of such a program should 
not be below 10 million dollars a year, with a long-term 
commitment for at least a constant level of funding in 
real dollars. 

Most of the fundamental research under such a program 
would be carried out at universities and at appropriate 
NASA centers. Collaboration with industry should be 
encouraged to expedite technology transfer. To meet 
the emerging mission requirements, parallel advanced 
development programs within all of NASA’s mission 
offices are required. 

Following is a list of problem areas that should set 
some goals for both the basic science research program 
and the advanced development effort: 

• Smart sensing; automated content analysis; stereo* 
mapping for eventual Earth and planetary applica- 
tions. 

• Manipulator design, particularly for autonomous 
use, including structures and effectors, force and 
touch detectors. 


56 



• Control and feedback systems, particularly those 
relevant to manipulation and teleoperator develop- 
ment. 

• Spacecraft crisis analysis systems. 

• Locomotion systems, particularly legged locomotion 
for difficult terrain. 

• Attempts at space qualification of multiple batches 
of existing microprocessors and memory chips. 

• Preliminary studies of automatic and teleoperator 
assembly of large structures for Earth orbital, lunar, 
and asteroidal environments. 

• Vision systems, particularly for use in locomotion 
and automated assembly. 

• Control and reasoning systems, particularly in 
support of lunar and planetary rovers. 

• Computer architectures for space systems. 

• Software tools for space system development. 

• Algorithm analysis for critical space-related 
problems. 

• Computer networks and computer-aided telecon- 
ferencing. (See paragraph (d) below.) 

The current university-based support from NSF and 
ARPA in computer science and machine intelligence 
is about 15 million dollars each annually. The level of 
university funding recommended here would be larger 
by about 30 percent, allowing NASA to compete effec- 
tively for the best talent and ideas. Parallel programs 
conducted by NASA program offices, which would be 
based strongly at NASA centers and industry, would 
approximately double the support requirement. The 
total support might eventually approach the 100 mil- 
lion dollar level, if NASA were seriously to pursue a 
broad program of research in computer science. 

(b) Augment the advisory structure of NASA by adding 
computer scientists to implement the foregoing 
recommendations. 

NASA is far enough behind the leading edge of the 
computer science field that major improvements in 
its operations can be made immediately using existing 


computer science systems and techniques such as 
modern data abstraction languages, time-sharing, inte- 
grated program development environments, and larger 
virtual memory computers (especially for onboard 
processing). Such general improvements in sophistica- 
tion are almost a prerequisite for a later utilization of 
machine intelligence and robotics in NASA activities. 
The advisory organizations should help plan and 
coordinate NASA’s effort in the field and establish 
contacts with the centers of computer science research. 

(c) Because of the connection of the Defense Mapping 
Agency’s (DMA) Pilot Digital Operations Project with 
NASA interests, NASA should maintain appropriate 
liaison. 

DMA has studied the advanced techniques in computer 
science with an emphasis on machine intelligence. 
There may be a strong relationship between many 
DMA concerns and related issues in NASA, particu- 
larly in scene analysis and understanding, large data- 
base management, and information retrieval. An 
evaluation by NASA of the DMA planning process 
associated with the DMA Pilot Digital Operations 
Project should aid in estimating the costs of NASA’s 
development in this field. 


(d) NASA should form a task group to examine the 
desirability, feasibility, and general specification of an 
all-digital, text-handling, intelligent communication 

system. 

A significant amount of NASA’s budget is spent in 
the transfer of information among a very complex, 
geographically, and institutionally disparate set of 
groups that need to exchange messages, ideas, require- 
ments, and documents quickly to keep informed, to 
plan activities, and to arrive at decisions. 

Based on a rough estimate, we predict that such an 
all-digital network would lead to significant improve- 
ments over the present method of carrying out these 
functions. In addition to the cost savings, there would 
be improvements in performance. Although it would 
not eliminate the use of paper and meetings as a means 
of communication, it would save tons of paper and 
millions of man-miles of energy-consuming travel. This 
system would facilitate and improve the participation 
of scientists in all phases of missions as well as enhance 
their ability to extract the most value from postmission 
data analysis. 
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The implementation of such a system would not be 
predicated on new developments in artificial intelli- 
gence, but on the tools that are in common use at 
artificial intelligence nodes of the ARPA network and 
are part of the developing technology of digital infor- 
mation and word processing. If such a development 
were carried out, it would provide the data base for 
sophisticated techniques, as they become available, 
for information retrieval, semantic search, and decision 


making, and a model for other public and private 
organizations, scientific, technological, and industrial. 


The task group to investigate this development shou'd 
include elements of NASA management, mission plan- 
ning and operations, scientific investigators, and 
information scientists, as well as specialists in artificial 
intelligence. 
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Biographical Sketches 

Dr. Carl Sagan is Director of the Laboratory for Planetary 
Studies, and David Duncan Professor of Astronomy and Space 
Sciences at Cornell University. His principal research activities are 
in the physics and chemistry of planetary atmospheres and sur- 
faces, space-vehicle exploration of the planets, the origin of life 
on Earth and the search for life elsewhere. Dr. Sagan played a 
leading role in the Mariner, Viking and Voyager missions to the 
planets, for which he received the NASA Medals for Exceptional 
Scientific Achievement and for Distinguished Public Service, and 
the international astronautics prize, the Prix Galabert. He has 
served as Chairman of the Division for Planetary Sciences of the 
American Astronomical Society, as Chairman of the Astronomy 
Section of the American Association for the Advancement of 
Science, and as President of the Planetology Section of the 
American Geophysical Union. For twelve years he was Editor-in- 
Chief of ICARUS: International Journal of Solar Systems 
Studies, the leading professional magazine devoted to planetary 
research. He is a Fellow of the American Academy of Arts and 
Sciences and a member of the Presidential Commission on a Na- 
tional Agenda for the 1980’s. In addition to 400 published scien- 
tific and popular articles. Dr. Sagan is author, co-author or editor 
of more than a dozen books including, Intelligent Life in the 
Universe (1966); The Cosmic Connection (1973); The Dragons of 
Eden ( 1977 ), for which he was awarded the Pulitzer Prize; Mur- 
murs of Earth (1978); and, Broca's Brain (1979). In 1975, he 
received the Joseph Priestley Award “for distinguished contribu- 
tions to the welfare of mankind.” 

Dr. Raj Reddy is professor of computer science at Carnegie- 
Mellon University, Pittsburgh, Pennsylvania. Prior to joining the 
Carnegie-Mellon faculty in 1969, he was an assistant professor of 
computer science at Stanford University, and also was an applied 
science representative with the IBM World Trade Corporation. 
He received a Ph.D. degree in computer science from Stanford in 
1966, having previously attended the University of Madras, India, 
and the University of New South Wales, Australia. His research 
interests in computer science are in the areas of artificial in- 
telligence and man-machine communication. In particular, Dr. 
Reddy is working on speech input to computers, visual input to 
computers, graphics, and task-oriented computer architectures. 
He is on the editorial boards of Artificial Intelligence, Image Pro- 
cessing and Computer Graphics, Cognitive Science, and IEEE 
Transactions on Pattern Analysis and Machine Intelligence. 

Dr. Ewald Heer is a technical manager in the Office of 
Technology and Space Program Development at the Jet Propul- 
sion Laboratory (JPL) leading the technology development pro- 
grams for autonomous systems and space mechanics. After 
receiving a Dr. Engr. Sc. degree, specializing in system engineer- 
ing, he conducted ■ and managed several research and advanced 
development projects at McDonnell Douglas Corporation, 
General Electric Space Science Laboratory, and JPL. On assign- 


or future teleoperator and robot technology. He organized inter- 
national conferences on Remotely Manned Systems at Caltech in 
1972 and at USC in 1975. In addition to publishing technical ar- 
ticles on systems theory, teleoperatory, and robotics, he is 
robotics editor of the Journal of Mechanisms and Machine 
Theory and has edited two books. Since 1973, he is also 
associated with the University of Southern California teaching 
operations research and planning as an adjunct professor of in- 
dustrial and systems engineering. 

Dr. James S. Albus is project manager for sensors and computer 
control in the automation technology program of the National 
Engineering Laboratory of the National Bureau of Standards. He 
has received the Department of Commerce Silver Award for his 
work in control theory and manipulator design and the Industrial 
Research IR-100 Award for his work in brain modeling and com- 
puter design. Before joining the Bureau of Standards he designed 
attitude measurement systems for NASA spacecraft and for a 
short period was program manager of the NASA artificial in- 
telligence program. 

Dr. Robert M. Balzer attended Carnegie Institute of Tech- 
nology under a George Westinghouse scholarship and a National 
Science Foundation fellowship, where he received his B.S., M.S., 
and Ph.D. degrees in electrical engineering in 1964, 1965, and 
1966 respectively. He joined the RAND Corporation in June 1966 
where he was concerned with reducing the effort required to 
utilize computers for problem solving, especially in on-line en- 
vironment. In April 1972, he left RAND to help form the 
USC/Information Sciences Institute. He is currently an associate 
professor of computer science and project leader of the Specifica- 
tion Acquisition From Experts (SAFE) project. This project is at- 
tempting to aid users to compose precise and correct program 
specifications from informal natural language descriptions by 
resolving the ambiguity present through context provided by the 
system’s knowledge of programs, programming, and the applica- 
tion domain. 

Dr. Thomas O. Binford, a research associate in the Artificial 
Intelligence Laboratory of the Department of Computer Science 
at Stanford University, is presently working in the area of com- 
puter visions and robotics. Dr. Binford was at the AI lab at MIT 
previously. The Ph.D. received by Dr. Binford is from the 
University of Wisconsin. 

Dr. R. C. Gonzalez is professor of electrical engineering and 
computer science at the University of Tennessee, Knoxville, 
where he is also director of the Image and Patter Analysis 
laboratory. He is an associate editor of the International 
Journal of Computer and Information Sciences and is a con- 
sultant to government and industry, such as the Oak Ridge 
National Laboratory, NASA, the U.S. Army, and the Martin 
Marietta Corporation. Dr. Gonzalez is co-author of three 
books on pattern recognition and image processing. In 1978 
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he received a UTK Chancellor’s Research Scholar Award for 
his work in these fields. 

Dr. Peter E. Hart is the director of the Artificial Intelligence 
Center at SRI International, which is doing research on experi- 
mental automation and is developing expert consultation 
systems. Other professional experience of Dr. Hart has been as 
a lecturer, computer science department, Stanford University 
and as a staff engineer at the Philco Western Development 
Laboratory. His academic background, BEE (1962) Rensselaer 
Polytechnic Institute, MS (1963), and PhD (1966) in electrical 
engineering, Stanford University. Dr. Hart is a coauthor of 
Pattern Classification and Scene Analysis, John Wiley & Sons, 
Inc. (1973) and has published 16 articles in, for example, Proc. 
Int. Joint Conf. on Artificial Intelligence, Commun. ACM, 
Artificial Intelligence, IEEE Trans. Sys. Sci & Cybernetics. 
Professional associations and honors of Dr. Hart are American 
Association for the Advancement of Science; Association for 
Computing Machinery; past chairman. Bay Area Chapter of 
IEEE Group on Systems Science and Cybernetics; Eta Kappa 
Nu; Sigma Xi; Tau Beta Pi; editorial board, Current Contents. 

Dr. John Hill is a staff member of the Artificial Intelligence 
Center at SRI International, Menlo Park, California. His back- 
ground includes work as a research engineer for Stanford 
Research Institute in teleoperator research and as charge’ de 
researchne for the Biomechanics Research Laboratory in 
France in prosthetics control. He received a BSEE (1961) and 
a MSEE (1963) degree from the University of Illinois, Urbana, 
and a PhD in electrical engineering (1967) from Stanford 
University. His research interests are design and control of 
robot devices for automatic assembly and advances in indus- 
trial automation. Dr. Hill is a member of the Robot Institute 
of America and the Society of Manufacturing Engineers. 

Mr. B. Gentry Lee is the manager of mission operations and 
engineering for the Galileo project at the Jet Propulsion Labo- 
ratory, Pasadena, California. Mr. Lee’s education is as follows: 
BA, summa cum laude. University of Texas, January 1963, Phi 
Beta Kappa at age 19, undergraduate studies in languages, liter- 
ature, mathematics; MS, mathematics, physics and aerospace, 
MIT, June 1964; Woodrow Wilson fellow at MIT, and Marshall 
fellow at University of Glasgow, Scotland, 1964-65. Profes- 
sionally Mr. Lee has been as follows: Aerospace engineer, 
Martin Marietta Corporation, 1965-1975. Director of science’ 
analysis and mission planning for the Viking flight team in 
Pasadena, California. This executive position involved the 
operational management of all 200 scientists and mission plan- 
ners associated with the first landing on the planet Mars. 
Earlier Viking management positions included mission opera- 
tions manager and navigation manager. Aerospace manager, Jet 
Propulsion Laboratory, 1975-1978. His first JPL position was 


manager of Mission Design Section. Mr. Lee was responsible 
for top-level design of all U.S. lunar and interplanetary mis- 
sions. Mr. Lee’s current position is manager of Mission Opera- 
tions and Engineering for Project Galileo, which will be an 
in-depth investigation of Jupiter and its moons during the mid- 
die of the 80s decade. 

Dr. Elliott C. Levinthal is adjunct professor and director of 
the Instrumentation Research Laboratory Department of 
Genetics, Stanford University School of Medicine. Previously 
he was associate dean for research affairs, Stanford University 
School of Medicine. He received a PhD degree from Stanford 
and holds degrees from Columbia and MIT. He is a principal 
investigator on the Viking 1975 Lander Imaging Science team 
and deputy team leader. In 1977 he received the NASA Public 
Service Medal for “Exceptional Contribution to the Success of 
the Viking Project.” He has served for several years as a con- 
sultant to NASA and was co-investigator on the Mariner Mars 
1971 photo interpretation team. Dr. Levinthal’s research inter- . 
ests include medical electronics, exobiology, application of 
computers to image processing, and medical information sys- 
tems. He is a member of the American Association for the 
Advancement of Science, American Physical Society, IEEE, 
Sigma Xi, Optical Society of America, and the Biomedical 
Engineering Society. 

Dr. Jack Minker is a professor and chairman of computer 
science at the University of Maryland, College Park, Mary- 
land. Prior to joining the faculty at the University of Mary- 
land, he served as acting office manager and technical director 
of the Auerbach Corporation’s Washington office, and as 
manager of information systems technology at RCA. He 
received a PhD degree in mathematics from the University of 
Pennsylvania in 1959 and previously received an MS in 
mathematics from Brooklyn College. His research interests 
in computer science are in artificial intelligence, automatic 
theorem proving, and database systems. He is a member of 
the Association for Computing Machinery, SIAM, and IEEE. 

He is on the editorial boards of Information Systems and 
Encyclopedia of Library and Information Science. 

Dr. Marvin Minsky occupies the chair of Donner professor 
of science at MIT. He was founder and director of the MIT 
Artificial Intelligence Laboratory until appointing Dr. Winston 
to the position. Dr. Minsky has played a central role in the - 
scientific design of many of today’s research programs in 
robotics and artificial intelligence. He is author of books and 
papers on the subjects of artificial intelligence, theory of 
computational complexity, cognitive psychology, and physi- 
cal optics. For outstanding contributions to computer science, 
he received the ACM’s Turing Award. He is a member of the 
National Academy of Sciences. 
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Dr. Donald A. Norman is a professor of psychology at the 
University of California at San Diego where he is also director 
of the program in cognitive science in the Center for Human 
Information Processing. From 1974 to 1978, he was chair of 
the Department of Psychology. His research interests concen- 
trate on understanding the psychological mechanisms of 
human cognition, with emphasis on problems in attention and 
memory. Dr. Norman received a BS degree from MIT and a MS 
degree from the University of Pennsylvania, both in electrical 
engineering. His doctorate, from the University of Pennsyl- 
vania, is in psychology. He has published in journals and 
books, and is the author or editor of four books. He is on the 
editorial boards of Cognitive Psychology , Journal of Cognitive 
Science, and the Journal of Experimental Psychology . He is a 
fellow of the American Psychological Association and of the 
American Association for the Advancement of Science. 

Dr. Charles J. Rieger received the BS degree in mathematics 
and computer science from Purdue University, Lafayette, 
Indiana, in 1970 and the PhD from Stanford University, Palo 
Alto, California in 1974. He is currently an associate professor 
in computer science at the University of Maryland, College 
Park, Maryland. His research interests are in the area of artifi- 
cial intelligence and cognitive modeling with particular empha- 
sis on models of human inference in language understanding. 

Dr. Thomas B. Sheridan is professor of engineering and 
applied psychology at MIT where his research is on man- 
machine control of aircraft, nuclear plants, and undersea tele- 
operators. He has served as visiting faculty member at the 
University of California at Berkeley, Stanford University, and 
the Delft University of Technology, the Netherlands. He is 
co-author of Man-Machine Systems (MIT Press, 1974) and 
Monitoring Behavior and Supervisory Control (Plenum, 1976). 
Formerly president of the IEEE Systems, Man and Cybernetics 
Society and editor of IEEE Transactions on Man-Machine 
Systems, he presently serves on the NASA Life Sciences Advi- 
sory Committee and the Congressional Office of Technology 
Assessment Task Force on Appropriate Technology. He is a 
fellow of the Human Factors Society and a recipient of the 
Society’s Paul M. Fitts award. 

Dr. William M. Whitney is the section manager for informa- 
tion systems research at the Jet Propulsion Laboratory in 
Pasadena, California. He also has a part-time assignment in the 
Office of Technology and Space Program Development to plan 
activities that will strengthen JPL in the technologies underly- 
ing its information-system applications. He received a BS 
degree from Caltech in 1951 in physics, and a PhD from MIT 
in 1956 in experimental low-temperature physics. He served as 
instructor and as assistant professor in the MIT physics depart- 
ment from 1956 until August 1963, when he joined JPL to 
conduct research in the guidance and control section. In 1967, 


he was appointed manager of that section, which became the 
forerunner of the present information systems research sec- 
tion. He led the planning that culminated in the creation of 
JPL’s robotics research program in 1972, and served as its 
technical leader until 1978. He was editor and a principal 
author of the section on information management in “A Fore- 
cast of Space Technology, 1980 — 2000,” prepared by JPL as 
a part of a broad “Outlook for Space” study conducted by 
NASA in 1974-75. Dr. Whitney has taken part in studies to set 
new directions for JPL and NASA research and development 
efforts. 

Dr. Patrick H. Winston is an associate professor of com- 
puter science at the Massachusetts Institute of Technology, 
where he is also director of the Artificial Intelligence Labora- 
tory. He is the editor of The Psychology of Computer Vision 
and the co-editor of Artificial Intelligence: An MIT Perspec- 
tive, as well as author of the textbook Introduction to Artifi- 
cial Intelligence , Addison-Wesley; New York. His research 
focuses on the subject of making computers learn. 

Dr. Stephen Yerazunis is associate dean of engineering and 
professor of chemical engineering at Rensselaer Polytechnic 
Institute, Troy, New York. He received a doctorate in chemi- 
cal engineering from Rensselaer Polytechnic Institute. His 
earlier research interests included vapor-liquid equilibria and 
heat and mass transfer phenomena at high mass transfer rates 
in turbulent flows. His current research involves appraisal of 
electrical energy alternatives for the State of New York and 
guidance of an autonomous rover for unmanned planetary ex- 
ploration. His interest in the latter is directed towards the 
development of short-range hazard detection and avoidance 
systems, as well as the configuration of the rover. He has been 
a consultant to the Knolls Atomic Power Laboratory and the 
General Electric Company. He is a fellow of the American 
Institute of Chemical Engineers. 

Dr. William B. Gevarter is in charge of NASA’s Space Guid- 
ance and Control, and artificial intelligence and robotics re- 
search programs. Previously he was with NASA Headquarters 
Office of Policy Analysis where he carried out research and 
analysis on the interaction of technology and society. He 
received a PhD degree in engineering from Stanford University, 
specializing in modem control theory. He has served as vice 
chairman of the American Society for Cybernetics and as 
chapter chairman of the Systems, Man and Cybernetics Soci- 
ety, Washington, D.C. He is a member of the American Insti- 
tute of Aeronautics and Astronautics, the World Future 
Society, the Society for General Systems Research, and the 
Association for Humanistic Psychology. 

Stanley R. Sadin is program manager of Space Systems 
Studies and Planning at NASA Headquarters. 


A-3 


NASA Study Group on Machine Intelligence and Robotics - Workshop I 

University of Maryland 

June 27-29, 1977 


Speaker 


Subject 


Stanley R. Sadin 
Daniel H. Herman 
King S. Fu 

Samuel W. McCandless 
Tom 0. Binford 
David Blanchard 
Richard C. Henry 
Harold B. Alsberg 
J. H. von Puttkamer 
Douglas A. Gilstad 
William L. Smith 
Dan C. Popma 
Leonard Friedman 
Lester K. Fero 
Simon V. Manson 
William B. Gevarter 
Stephen Yerazunis 
Charles J. Rieger 
Berthold Horn 


Program Overview 
Planetary Exploration 
Pattern Recognition 

Global Resources and Earth Observation 
Scene Analysis 

Flight Operations and Mission Control 
Search for Extraterrestrial Intelligence 
End-to-End Data Management 
Space Industrialization * 

Large Area Space Structures 
Remotely Operated Systems Development 
Teleoperator Supporting Research and Technology 
Robotic Tool Systems 

Space Transportation Systems and Associated Ground Operations 
Space Power 

NASA’s Robotics Program r 

Locomotion and Navigation of Roving Vehicles 
Communicating with Machines 

Machine Intelligence and Robotics - Prospects for Practical Applications 


NASA Study Group on Machine Intelligence and Robotics - Workshop IIA 

Jet Propulsion Laboratory 
September 28-29, 1977 


Speaker 


Subject 


Henry W. Norris 
Arden L. Albee 
Victor C. Clarke 
James R. French 
Henry W. Norris 
Marvin Minsky 
Boris M. Dobrotin 
Marvin Minsky 
William Whitney 
James D. Burke 
Garrett Paine 
Berthold Horn 
Robert B. McGhee 
William Whitney 
Charles J. Rieger 
George P. Textor 
James S. Albus 
Charles J. Rieger 


Results of Mars 1984 Mission Study 

Science Goals Achieved by a Rover 

Mission Design - Character of the 1984 Opportunity 

Rover System Description 

Project Overview - Alternate System Description, Costs, Schedule 
Manipulation for Planetary Surface Rovers 

Manipulation and Sensing Requirements as a Function of Science Objectives 

Summation and Discussions 

Locomotion for Planetary Surface Rovers 

Locomotion for Planetary Surface Rovers 

Locomotion for Planetary Surface Rovers 

Locomotion for Planetary Surface Rovers 

Locomotion for Planetary Surface Rovers 

Summation and Discussions 

Operations for Planetary Surface Rovers 

Operations for Planetary Surface Rovers 

Operations for Planetary Surface Rovers 

Summation and Discussions 


NASA Study Group on Machine Intelligence and Robotics - Workshop IIB 

Jet Propulsion Laboratory 

September 30, 1977 


Subject 

Algirdas A. Avizienis 
David A. Rennels 
Herbert Hecht 
Danny Cohen 
William M. Whitney 
Samuel Fuller 
Richard Greenblatt 
Marvin Minsky 
Justin Ratner 
Carver A. Mead 
Michael Ebersole 
Alan Perlis 
Ivan Sutherland 
Carver A. Mead 
Algirdas A. Avizienis 


Speaker 

Architectures for S/C Computers - Introduction and Overview 

Architectures for S/C Computers — S/C Dist. Computer Architectures 

Architectures for S/C Computers — Centralized Satellite Computer 

Architectures for S/C Computers — Discussions 

Architectures for S/C Computers - Discussions 

Trends in Computer Architectures - Multiprocessor Architectures 

Trends in Computer Architectures - LISP Processor 

Trends in Computer Architectures - Discussions 

Trends in LSI Technology - New Directions in MOS Technology 

Trends in LSI Technology - Designing in LSI 

Panel Discussion 

Panel Discussion 

Panel Discussion 

Panel Discussion 

Panel Discussion 


NASA Study Group on Machine Intelligence and Robotics - Workshop III 

Goddard Space Flight Center 

November 30, 1977 


Speaker 

David Blanchard 
John B. Zegalia 
Richard des Jar dins 
Stephen R. McReynolds 
John Y. Sos 
John J. Quann 
Robert D. Chapman 
Press Rose 
Robert Balzer 
Leonard Friedman 
B. Gentry. Lee 
James Porter 
B. A. Claussen 
Harlan Mills 
Azriel Rosenfeld 
Nico Habermann 
Robert Balzer 
John V. Guttag 
Brian Smith 
Mary Shaw 
Warren Teitelman 
Allen Newell 
Donald A. Norman 
Thomas B. Sheridan 
Donald A. Norman 


Subject 

NASA Organizations and Project Development Programs 

A Typical NASA End-to-End Data System 

Mission Independent Ground Operation Systems 

Survey of NASA Applications of Advanced Automation 

Trends in Space Telemetry Data Processing 

Large Data Base Application Requirements 

Need of Space Lab Facility Class Instruments of the Future 

Payload Software Technology 

Report on MSFC Data Management Symposium 

Report on AIAA Computers in Aerospace Conference 

Mission Operations for Planetary Missions 

Viking Mission Operation Strategy 

Viking Lander Software 

System Development Methodology 

Spacial Data Bases: Problems and Prospects 

System Development Control 

Program Specification and Verification 

Aspects of Program Specifications 

KRL: Knowledge Representation Language 

ALPHARD: A Language for the Development of Structured Programs 

Interactive Development of Large Systems 

ZOG: An Iterative System for Exploring Large Knowledge Bases 

Powers and Limitations of the Human Brain, Mind, Storage 

Discussion 

Discussion 
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NASA Study Group on Machine Intelligence and Robotics - Workshop IV 

Johnson Space Center 

February 1-2, 1978 

Speaker Subject 


Brian O’Leary 

Earle M. Crum 

George F. von Tiesenhausen 

W. H. Steurer 

C. C. Kraft 

Allen J. Louviere 

Robert V. Powell 

Ted Carey 

Hugh J. Dudley 

George W. Smith 

William R. Ferrell 

Oliver Selfridge 

Donald A. Norman 

Thomas B. Sheridan 


The Mining, Delivery and Processing of Non-Terrestrial Materials in Space 

Lunar Resources Utilization for Space Construction 

Space Processing and Manufacturing 

Recovery of Lunar Metals for Terrestrial Consumption 

Discussions 

Attached Manipulators and Fabrication in Space 
Large Antenna Reflectors Deployment and Erection 
Geostationary Platform Studies and Teleoperators 
Fabrication in Space and Simulation of Fabrication Operations 
Teleoperator Control for Space Construction 
Human Performance and Man-Machine Allocation for Space Tasks 
Multilevel Exploration: Human and Computer Roles 
Human Information Processing 
Summary Remarks 


NASA Study Group on Machine Intelligence and Robotics - Workshop V 

NASA Headquarters 

• March 8-9, 1978 


Speaker 


Subject 


Donald Williams/ 

Robert Cunningham 
Jay M. Tenenbaum 
Phillip H. Swain 
Henry Cook 

Alex F. H. Goetz 
Thomas Young 
Charles Elachi 
Edward J. Groth 
James Cutts 
Raj Reddy 
David Schaeffer 
Graham Nudd 
Q. R. Mitchell 
B. R. Hunt 

V. Casler/Ivan Sutherland 
Berthold Horn 
David Milgram/ 

Azriel Rosenfeld 
Jay M. Tenenbaum 


Automated Scene Analysis for Space Systems 

Application of AI to Remote Sensing 

At the Frontiers of Earth Resources Image Processing 

DMA Applications of Automatic Cartography & Possible Requirements for Machine 
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STEPS TOWARD 
ARTIFICIAL INTELLIGENCE 


by Marvin Minsky 


Introduction 

A visitor to our planet might be puzzled about the role of computers in 
our technology. On the one hand, he would read and hear all about wonder- 
ful “mechanical brains” baffling their creators with prodigious intellectual 
performance. And he (or it) would be warned that these machines must be 
restrained, lest they overwhelm us by might, persuasion, or even by the 
revelation of truths too terrible to be borne. On the other hand, our 
visitor would find the machines being denounced, on all sides, ^ for their 
slavish obedience, unimaginative literal interpretations, and incapacity 
for innovation or initiative; in short, for their inhuman dullness. 

Our visitor might remain puzzled if he set out to find, and judge for 
himself, these monsters. For he would find only a few machines (mostly 
“general-purpose” computers, programmed for the moment to behave ac- 
cording to some specification) doing things that might claim any real 
intellectual status. Some would be proving mathematical theorems of rather 
undistinguished character. A few machines might be playing certain games, 
occasionally defeating their designers. Some might be distinguishing be- 
tween hand-printed letters. Is this enough to justify so much interest, let 
alone deep concern? I believe that it is; that we are on the threshold of 
an era that will be strongly influenced, and quite possibly dominated, by 
intelligent problem-solving machines. But our purpose is not to guess about 
what the future may bring; it is only to try to describe and explain what 
seem now to be our first steps toward the construction of “artificial in- 
telligence.” 
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Along with the development of general-purpose computers, the past 
few years have seen an increase in effort toward the discovery and 
mechanization of problem-solving processes. Quite a number of papers 
ave appeare describing theories or actual computer programs concerned 
with game playing, theorem proving, pattern recognition, and other do- 
mains which would seem to require some intelligence. The literature does 
not include any general discussion of the outstanding problems of this field 
In this article, an attempt will be made to separate out, analyze, and 
find the relations between some of these problems. Analysis will be sup- 
ported with enough examples from the literature to serve the introductory 
function of a review article, but there remains much relevant work not de- 
scribed here. This report is highly compressed, and therefore, cannot be- 
gin to discuss all these matters in the available space. 

There is, of course, no generally accepted theory of “intelligence”; the 
analysis is our own and may be controversial. We regret that'we cannot 
give full personal acknowledgments here— suffice it to say that we have 
discussed these matters with almost every one of the cited authors. 

It is convenient to divide the problems into five main areas: Search, 
Pattern Recognition, Learning, Planning, and Induction; these comprise 

the main divisions of the report. Let us summarize, the entire argument 
very briefly: 

A computer can do, in a sense, only what it is told to do. But even 
when we do not know exactly how to solve a certain problem, we mat 
program a machine to Search through some large space of solution at- 
tempts. Unfortunately, when we write a straightforward program for such 
asearch, we usually find the resulting process to be enormously inefficient. 
With Pattern Recognition techniques, efficiency can be greatly improved 
by restnctmg the machine to use its methods only on the kind of att^mot^ 
for which they are appropriate. And with Learning, efficiency is furthc* 
improve y directing Search in accord with earlier exneriences. By ac 
tually analyzing the situation, using what we call Planning methods, the 
machrne may obtain a really fundamental improvement by replacin'* th»- 
ongmally given Search by a much smaller, more appropriate exploration 
rinally, in the section on Induction, we consider some rather more °loba» 
concepts of how one might obtain intelligent machine behavior. 

I. The Problem of Search 1 

If, for a given problem, we have a means for checking a proposed solu- 
tion, then we can solve the problem by testing all possible answers. But this 

“ T 1 hCre and Widdy in thC 1!teratUre ’ mea - elated 
ZZTJ, perlormnnce - a noun it fa also used in regard to any 

method or «r.ck used to improve the efficiency of a problem-solving system A 
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always takes much too long to be of practical interest. Any device that can 
reduce this search may be of value. If we can detect relative improvement, 
then “hill-climbing” (Sec. I-B) may be feasible, but its use requires 
some structural knowledge of the search space. And unless this structure 
meets certain conditions, hill-climbing may do more harm than good 

When we talk of problem-solving in what follows we will usually sup- 
pose that all the problems to be solved are initially well defined (McCarthy, 
1956). By this we mean that with each problem we are given some sys- 
tematic way to decide when a proposed solution is acceptable. Most of 
the experimental work discussed here is concerned with such well-defined 
problems as are met in theorem-proving, or in games with precise rules for 

play and scoring. ' . . 

In one sense all such problems are trivial. For if there exists a solution 

to such a problem, that solution can be found eventually by any blind 
exhaustive process which searches through all possibilities. And it is usu- 
ally not difficult to mechanize or program such a search. 

But for any problem worthy of the name, the search through aU pos- 
sibilities will be too inefficient for practical use. And on the other hand 
systems like chess, or nontrivial parts of mathematics, are too complicated 
for complete analysis. Without complete analysis, there must always re- 
main some core of search, or “trial and error.” So we need to find tech- 
niques through which the results of incomplete analysis can be used to 
make the search more efficient. The necessity, for this is simply over- 
whelming: a search of all the paths through the game of checkers involve, 
some 10“ move choices (Samuel, 1959a), in chess, some 10-- (Shannon, 
in Newman, 1956). If we organized all the particles in our galaxy into 
some kind of parallel computer operating at the frequency of hard cosmic 
rays the latter computation would still take impossibly long; we canno 
expect improvements in “hardware” alone to solve all our problems 
Certainly we must use whatever we know in advance to guide the trial 
generator. And we must also be able to make use of results obtained along 

the way. 2 ’ 3 

“heuristic program,” to be considered successful, must work well on a variety of 
problems, Ld may often be excused if it fails on some. We often find « 
to introduce a heuristic method which happens to cause occasional I failures if ther. 
is an over-all improvement in performance. But imperfect methods are no < necc - 
sarily heuristic, nor vice versa. Hence “heuristic” should not be regarded as opposi- 
te “foolproof’; this has caused some confusion in the literature. • 

1 McCarthy (1956) has discussed the enumeration problem from a recursi 
function-theory point of view. This incomplete but suggestive paper *”*!£*’ JJJ 0 "’ 
other things, that “the enumeration of partial recursive functions should gi e .. 
early place to compositions of functions that have already appeared , 

/regard this as an important notion, especially in the light of Shannons res ' 
(1949) on two-terminal switching circuits — that the “average n-varia e swi i ■- 
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A. Relative Improvement, HiH-climbing, and Heuristic Connections 

A problem can hardly come to interest us if we have no background of 
information about it. We usually have some basis, however flimsy, for de- 
eding improvement; some trials will be judged more successful than others, 
uppose, for example, that we have a comparator which selects as the 
better, one from any pair of trial outcomes. Now the comparator cannot 
alone, serve to make a problem well defined. No goal is defined. But if 
the comparator-defined relation between trials is “transitive” (i.e, if A 
ominates B and B dominates C implies. that A dominates C), then we 

can at least define “progress,” and ask our machine, given a time limit, to 
do the best it can. 

But it is essential to observe that a comparator by itself, however 
shrewd, cannot alone give any improvement over exhaustive search The 
comparator gives us information about partial success, to be sure. But we 
need also some way of using this information to direct the pattern of 
search in promising directions; to select new trial points which are in some 
sense hke, or “similar to,” or “in the same direction as” those which 
have given the best previous results. To do this we need some additional 
structure on the search space. This structure need not bear much resem- 
blance to the ordinary spatial notion of direction, or that of distance, but 
it must somehow tie together points which are heuristically related. 

e will call such a structure a heuristic connection. We introduce this 
term for informal use only— that is why our definition is itself so informal. 
But we need it. Many publications have been marred by the misuse, for 
this purpose, of precise mathematical terms, e.g., metric and topological. 
The term connection,” with its variety of dictionary meanings, seems just 

e word to designate a relation without commitment as to the exact nature 
of the relation. 

An important and simple kind of heuristic connection is that defined 
w en a space has coordinates (or parameters) and there is also defined a 
numerical “success function” E which is a reasonably smooth function of 

the th C< d° rdinateS ’ HerC WC Can USe l0CaI optimization or Mi-climbing 


cSurtTSL? 0 " f V " C ° ntaC ‘ S - Th, ' S dl ' Sa5ter d0eS n0t usua, 'y strike -hen we 
eomTocV /, 8 • rgC machines * Prefab!/ because they are based on 
composition of funcnons already found useful. One should not overlook the 

S ( S, 0f Newe " (1955) * and “ discussion of .he minimaxing 

'^j, 95 , 2 and eSPeC T ia ‘ Iy in I956 ’ Mhh y has an excellent discussion of the search 
ttabdhv ” 2”r er ’ ^ . not ,. C ° nvinced of the usefulness of his notion of “ultra- 
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B. Hill-climbing 

Suppose that we are given a black-box machine with inputs A„ . ... , A, 
and an output E(A„ . . . , A„). We wish to maximize E by adjusting the 
input values. But we are not given any mathematical description of the 
function E; hence we cannot use differentiation or related methods. The 
obvious approach is to explore locally about a point, finding die direction 
of steepest ascent. One moves a certain distance in that direction and 
repeats the process until improvement ceases. If the hill- is smooth this 
may be done, approximately, by estimating the gradient component 
dE/d Ai separately for each coordinate A t . There are more sophisticate 
approaches (one may use noise added to each variable, and correlate the 
output with each input, see Fig. 1), but this is the general idea It is a 
fundamental technique, and we see it always in the background of far more 
complex systems. Heuristically, its great virtue is this: the sampling effort 
(for determining the direction of the gradient) grows, in a sense, on.v 
linearly with the number of parameters. So if we can solve, by sue a 
method, a certain kind of problem involving many parameters, then the 
addition of more parameters of the same kind ought not cause an in- 
ordinate increase in difficulty. We are particularly interested in problem- 
solving methods which can be so extended to more difficult problems. Alas, 
most interesting systems which involve combinational operations usually 
grow exponentially more difficult as we add variables. 

A great variety of hill-climbing systems have been studied under the 
names of “adaptive” or “self-optimizing” servomechanisms. 


From other U '$ 



Figure 1. “Multiple simultaneous optimizers” search for a (local) maximum value 

of Tome function E<\„ A.) of several parameters. Each unit If. ‘"Jpendcntly 

“jitters’* its parameter perhaps randomly, by adding a variation . t) o * 

mean value tn. The changes in the quantities 5. and E are correlated, and the 
is used to (slowly) change The filters are to move d-c components Th.s sunuN 
taneous technique, really a form of coherent detection, usually has an advantage 
over methods dealing separately and sequentially with each parameter. [ . 

discussion of “informative feedback” in Wiener (1948, pp. 13jft.).l 
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C. Troubles with Hill*climbing 

Obviously, the gradient-following hill-climber would be trapped if it should 
reach a local peak which is not a true or satisfactory optimum. It must 
then be forced to try larger steps or changes. 

It is often supposed that this false-peak problem is the chief obstacle to 
machine learning by this method. This certainly can be troublesome. But 
for really difficult problems, it seems to us that usually the more funda- 
mental problem lies in finding any significant peak at all. Unfortunately 
the known E functions for difficult problems often exhibit what we have 
called (Minsky and Selfridge, 1960) the “Mesa Phenomenon “ in which a 
small change in a parameter usually leads to either no change in per- 
formance or to a large change in performance. The space is thus com- 
posed primarily of flat regions or “mesas.” Any tendency of the trial 
generator to make small steps then results in much aimless wandering 
without compensating information gains. A profitable search in such a 
space requires steps so large that hill-climbing is essentially ruled out. The 
problem-solver must find other methods; hill-climbing might still be feasible 
with a different heuristic connection. 

Certainly, in our own intellectual behavior we rarely solve a tricky prob- 
lem by a steady climb toward success. I doubt that in any one simple 
mechanism, e.g., hill-climbing, will we 'find the means to build an efficient 
and general problem-solving machine. Probably, an intelligent machine 
wUl require a variety of different mechanisms. These will be arranged in 
hierarchies, and in even more complex, perhaps recursive, structures. And 
perhaps what amounts to straightforward hill-climbing on one level may 
sometimes appear (on a lower level) as the sudden jumps of “insight.” 

II. The Problem of Pattern Recognition 

In order not to try all possibilities, a resourceful machine must classify 
problem situations into categories associated with the domains of effective- 
ness of the machine’s different methods. These pattern-recognition methods 
must extract the heuristically significant, features of the objects in question. 
The simplest methods simply match the objects against standards or proto- 
types. More powerful “property-list” methods subject each object to a 
sequence of tests, each detecting some property of heuristic importance. 
These properties have to be invariant under commonly encountered forms 
of distortion. Two important problems arise here — inventing new useful 
properties, and combining many properties to form a recocnition svstem. 
For complex problems, such methods will have to be augmented by facilities 
for subdividing complex objects and describing the complex relations 
between their parts. 
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Any powerful heuristic program is bound to contain a variety of different 
methods and techniques. At each step of the problem-solving process the 
machine will have to decide what aspect of the problem to work on, and 
then which method to use. A choice must be made, for we usually cannot 
afford to try all the possibilities. In order to deal with a goal or a problem, 
that is, to choose an appropriate method, we have to recognize what kind 
of thing it is. Thus the need to choose among actions compels us to provide 
the machine with classification techniques, or means of evolving them. It is 
of overwhelming importance that the machine have classification techniques 
which are realistic. But “realistic” can be defined only with respect to the 
environments to be encountered by the- machine, and with respect to the 
methods available to it. Distinctions which cannot be exploited are not 
worth recognizing. And methods are usually worthless without classifica- 
tion schemes which can help decide when they are applicable. 

A. Teleological Requirements of Classification 

The useful classifications are those which match the goals and methods 
of the machine. The objects grouped together in the classifications should 
have something of heuristic value in common; they should be “similar in a 
useful sense; they should depend on relevant or essential features. We 
should not be surprised, then, to find ourselves using inverse or teleological 
expressions to define the classes. We really do want to have a grip on “the 
class of objects which can be transformed into a result of form Y,” that is, 
the class of objects which will satisfy some goal. One should be wary of 
the familiar injunction against using teleological language in science. While 
it is true that talking of goals in some contexts may dispose us towards 
certain kinds of animistic explanations, this need not be a bad thing in the 
field of problem-solving; it is hard to see how one can solve problems 
without thoughts of purposes. The real difficulty with teleological defini- 
tions is technical, not philosophical, and arises when they have to be used 
and not just mentioned. One obviously cannot afford to use for classifica- 
tion a method which actually requires waiting for some remote outcome, 
if one needs the classification precisely for deciding whether to try out that 
method. So, in practice, the ideal teleological definitions often have to be 
replaced by practical approximations, usually with some risk of error; 
that is, the definitions have to be made heuristically effective , or eco- 
nomically usable. This is of great importance. (We can think of “heuristic 
effectiveness” as contrasted to the ordinary mathematical notion of “effec- 
tiveness” which distinguishes those definitions which can be realized at all 
by machine, regardless of efficiency.) 

B. Patterns and Descriptions 

It is usually necessary to have ways of assigning names — symbolic expres- 
sions— to the defined classes. The structure of the names will have a 
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crucial influence on the mental world of the machine, for it determines 
what kinds of things can be conveniently thought about. There are a 
variety of ways to assign names. The simplest schemes use what we will 
call conventional (or proper ) names; here, arbitrary symbols are assigned 
to classes. But we will also want to use complex descriptions or computed 
names; these are constructed for classes by processes which depend on the 
class definitions. To be useful, these should reflect some of the structure 
of the things they designate, abstracted in a manner relevant to the problem 
area. The notion of description merges smoothly into the more complex 
notion of model; as we think of it, a model is a sort of active description. 
It is a thing whose form reflects some of the structure of the thing repre- 
sented, but which also has some of the character of a working machine. 

In Sec. Ill we will consider “learning” systems. The behavior of those 
systems can be made to change in reasonable ways depending on what 
happened to them in the past. But by themselves, the simple learning 
systems are useful only in recurrent situations; they cannot cope with any 
significant novelty. Nontrivial performance is obtained only when learning 
systems are supplemented with classification or pattern-recognition methods 
of some inductive ability. For the variety of objects encountered in a non- 
trivial search is so enormous that we cannot depend on recurrence, and 
the mere accumulation of records of past experience can have only limited 
value. Pattern Recognition, by providing a heuristic connection which 
links the old to the new, can make learning broadly useful. 

What is a “pattern”? We often use the term teleologically to mean a 
set of objects which can in some (useful) way be treated alike. For each 
problem area we must ask, “What patterns would be useful for a machine 
working on such problems?” 

The problems of visual pattern recognition have received much attention 
in recent years and most of our examples are from this area. 

C. Prototype-derived Patterns 

'nie problem of reading printed characters is a clearcut instance of a situa- 
tion in which the classification is based ultimately on a fixed set of “proto- 
types” e.g., the dies from which the type font was made. The individual 
marks on the printed page may show the results of many distortions. Some 
distortions are rather systematic; change in size, position, orientation. 
Some are of the nature of noise; blurring, grain, low contrast, etc. 

If the noise is not too severe, we may be able to manage the identifica- 
tion by what we call a normalization and template-matching process. We 
first remove the differences related to size and position— that is, we 
normalize the input figure. One may do this, for example, by constructing 
a similar figure inscribed in a certain fixed triangle (see Fin. 2); or one 
may transform the figure to obtain a certain fixed center of gravity and a 
unit second central moment. [There is an additional problem with rotational 
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equivalence where it is not easy to avoid all 
ambiguities. One does not want to equate “6' 1 
and “9.” For that matter, one does not want 
to equate (0,o), or (AT,x) or the 6 s in x 0 and 
so that there may be context dependency 
involved.] Once normalized, the unknown 
figure can be compared with templates for the 
prototypes and, by means of some measure of 
matching, choose the best fitting template. 
Each “matching criterion” will be sensitive to 
particular forms of noise and distortion, and 
so will each normalization procedure. The in- 
scribing or boxing method may be sensitive to 
small specks, while the moment method will 
be especially sensitive to smearing, at least for 
thin-line figures, etc. The choice of a matching 
criterion must depend on the kinds of noise 
and transformations commonly encountered. 
Still, for many problems we may get acceptable results by using straight- 
forward correlation methods. 

When the class of equivalence transformations is very large, e.g., when 
local stretching and distortion are present, there will be difficulty in finding 
a uniform normalization method. Instead, one may have to consider a 
process of adjusting locally for best fit to the template. (While measuring 
the matching, one could “jitter;’ the figure locally; if an improvement were 
found the process could be repeated using a slightly different change, etc.) 
There is usually no practical possibility of applying to the figure all of the 
admissible transformations. And to recognize the topological equivalence 
of pairs such as those in Fig. 3 is likely beyond any practical kind of itera- 
tive local-improvement or hill-climbing matching procedure. (Such recog- 
nitions can be mechanized, though, by methods which follow lines, detect 
vertices, and build up a description in the form, say,* of a vertex-connection 
table.) 


Figure 2. A simple normal- 
ization technique. If an ob- 
ject is expanded uniformly, 
without rotation, until it 
touches all three sides of a 
triangle, the resulting figure 
will be unique, aqd pattern 
recognition can proceed 
without concern about re- 
lative size and position. 



(<2) ( a 1 . W 


Figure 3. The figures A, A' and B. B' are topologically equivalent pairs. Lengths 
have been distorted in an arbitrary manner, but the connectivity relations between 
corresponding points have been preserved. In Sherman (1959) and Haller (1959) 
we find computer programs which can deal with such equivalences. 
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"Hie template-matching scheme, with its normalization and direct com- 
parison and matching criterion, is just too limited in conception to be of 
much use in more difficult problems. If the transformation set is large, 
normalization, or “fitting,” may be impractical, especially if there is no 
adequate heuristic connection on the space of transformations. Further- 
more, for each defined pattern, the system has to be presented with a proto- 
type. But if one has in mind a fairly abstract class, one may simply be 
unable to represent its essential features with one or a very few concrete 
examples. How could one represent with a single prototype the class of 
figures which have an even number of disconnected parts? Clearly, the 
template system has negligible descriptive power. The property-list system 
frees us from some of these limitations. 

D. Property Lists and “Characters” 

We define a property to be a two-valued function which divides figures 
into two classes; a figure is said to have or not have the property according 
to whether the function’s value is 1 or 0. Given a number N .of distinction 
properties, we could define as many as 2" subclasses by their set inter- 
sections and, hence, as many as 2 2 ' patterns by combining the properties 
with AND’s and OR’s. Thus, if we have three properties, rectilinear, con- 
nected, and cyclic, there are eight subclasses (and 256 patterns) defined 
by their intersections (see Fig. 4). 

If the given properties are placed in a fixed order then we can represent 
any of these elementary regions by a vector, or string of digits. The vector 
so assigned to each figure will be called the Character of that figure (with 
respect to- the sequence of properties in question). [In. “Some Aspects of 
Heuristic Programming and Artificial Intelligence” (1959a), we use the 
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term characteristic for a property without restriction to 2 values.] Thus a 
square has the Character (1,1,1) and a circle the Character (0,1,1) for 
the given sequence of properties. 

For many problems one can use such Characters as names for categories 
and as primitive elements with which to define an adequate set of patterns. 
Characters are more than conventional names. They are instead very 
rudimentary forms of description (having the form of the simplest sym- 
bolic expression — the list) whose structure provides some information 
about the designated classes. This is a step, albeit a small one, beyond the 
template method; the Characters are not simple instances of the patterns, 
and the properties may themselves be very abstract. Finding a good set of 
properties is the major concern of many heuristic programs. 

E. Invariant Properties 

One of the prime requirements of a good property is that it be invariant 
under the commonly encountered equivalence transformations. Thus for 
visual Pattern Recognition we would usually want the object identification 
to be independent of uniform changes in size and position. In their pioneer- 
ing paper Pitts and McCulloch (1947) describe a general technique for 
forming invariant properties from noninvariant ones, assuming that the 
transformation space has a certain (group) structure. The idea behind 
their mathematical argument is this: suppose that we have a function P 
of figures, and suppose that for a given figure F we define [F] = (Fi,F 2 , 

. . . } to be the set of all figures equivalent to F under the given set of 
transformations; further, define P[F] to be the set {P(F X ),P(F 2 ), . . . } 
of values of P on those figures. Finally, define P*[F] to be AVERAGE 
(P[F]). Then we have a new property P* whose values are independent 
of the selection of F from an equivalence class defined by the transforma- 
tions. We have to be sure that when different representatives are chosen 
from a class the collection [F] will always be the same in each case. In the 
case of continuous transformation spaces, there will have to be a measure 
or the equivalent associated with the set [F] with respect to which the 
operation AVERAGE is defined, say, as an integration. 4 

This method is proposed (Pitts and McCulloch, 1947) as a neuro- 
physiological model for pitch-invariant hearing and size-invariant visual 

4 In the case studied in Pitts and McCulloch (1947) the transformation space is a 
group with a uniquely defined measure: the set [F] can be computed without repeti- 
tions by scanning through the application of all the transforms T « to the given figure 
so that the invariant property can be defined by 

P*(F) = |^ 0 P(r a CD)dM 

where G is the group and n the measure. By substituting T*j(F) for F in this, one 
can see that the result is independent of choice of p since we obtain the same 
integral over Gp' 1 » G. 
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recognition (supplemented with visual centering mechanisms). This model 
is discussed also by Wiener. 5 Practical application is probably limited to 
one-dimensional groups and analog scanning devices. 

In much recent work this problem is avoided by using properties already 
invariant under these transformations. Thus a property might count the 
number of connected components in a picture — this is invariant under 
size and position. Or a property may count the number of vertical lines in 
a picture — this is invariant under size and position (but not rotation). 

F. Generating Properties 

The problem of generating useful properties has been discussed by 
Selfridge (1955); we shall summarize his approach. The machine is given, 
at the start, a few basic transformations A u . . . , A „, each of which 
transforms, in some significant way, each figure into another figure. A t 
mi Sht, for example, remove all points not on a boundary of a solid region; 
A 2 might leave only vertex points; A 3 might fill up hollow regions , etc. (see 
Fig. 5). Each sequence AuAij . . . A n of these forms a new trans- 
formation, so that there is available an infinite variety. We provide the 
machine also with one or more ‘‘terminal” operations which convert a 
picture into a number, so that any sequence of the elementary transforma- 
tions, followed by a terminal operation, defines a property. [Dineen (1955) 
describes how these processes were programmed in a digital computer.] 
We can start with a few short sequences, perhaps chosen randomly. 
Selfridge describes how the machine might leam new useful properties. 

We now feed the machine A’s and O’s telling the machine each time 
which letter it is. Beside each sequence under the two letters, the 
machine builds up distribution functions from the results of applying 
the sequences to the image. Now, since the sequences were chosen 
completely randomly, it may well be that most of the sequences have 
very flat distribution functions; that is, they [provide ] no information, 
•See pp. 160ff. of Wiener (1948). 
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Figure 5. An arbitrary sequence of picture transformations, followed by a numerical- 
valued function, can be used as a property function for pictures. A x removes all 

points which are not at the edge of a solid region. A* leaves only vertex points 

at which an arc suddenly changes direction. The function C simply counts the 
number of points remaining in the picture. All remarks in the text could be 
generalized to apply to properties like AtA^C, which can have more than two values. 
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and the sequences are therefore [by definition ] not significant . Let it 
discard these and pick some others . Sooner or later , however, some 
sequences will prove significant; that is, their distribution functions 
will peak up somewhere . What the machine does now is to build up 
new sequences like the significant ones . This is the important point . 
If it merely chose sequences at random it might take a very long 
while indeed to find the best sequences . But with some successful 
sequences, or partly successful ones, to guide it, we hope that the 
process will be much quicker . The crucial question remains: how do 
we build up sequences “like” other sequences, but not identical? As of 
now we think we shall merely build sequences from the transition 
frequencies of the significant sequences. We shall build up a matrix 
of transition frequencies from the significant ones, and use those as 
transition probabilities with which to choose new sequences. 

We do not claim that this method is necessarily a very good way 
of choosing sequences — onty that it should do better than not using 
at all the knowledge of what kind of sequences has worked. It has 
seemed to us that this is the crucial point of learning* 

It would indeed be remarkable if this failed to yield properties more 
useful than would be obtained from completely random sequence selection. 
The generating problem is discussed further in Minsky (l$56a). Newell, 
Shaw, and Simon (1960Z?) describe more deliberate, less statistical, tech- 
niques that might be used to discover sets of properties appropriate to a 
given problem area. One may think of the Selfridge proposal as a system 
which uses a finite-state language to describe its properties. Solomonoff 
(1957, 1960) proposes some techniques for discovering common features 
of a set of expressions, e.g., of the descriptions of those properties of 
already established utility; the methods can then be applied to generate 
new properties with the same common features. I consider the lines of 
attack in Selfridge (1955), Newell, Shaw and Simon (1960a), and 
Solomonoff (1960, 1958), although still incomplete, to be of the greatest 
importance. 

G. Combining Properties 

One cannot expect easily to find a small set of properties which will be 
just right for a problem area. It is usually much easier to find a large set 
of properties each of which provides a little useful information. Then one 
is faced with the problem of finding a way to combine them to make the 
desired distinctions. The simplest method is to choose, for each class, a 
typical character (a particular sequence of property values) and then to 
use some matching procedure, e.g., counting the numbers of agreements 
and disagreements, to compare an unknown with these chosen “Character 

•See p. 93 of Selfridge (19551. 
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prototypes.” The linear weighting scheme described just below is a slight 
generalization on this. Such methods treat the properties as more or less 
independent evidence for and against propositions; more general pro- 
cedures (about which we have yet little practical information) must ac- 
count also for nonlinear relations between properties, i.e., must contain 
weighting terms for joint subsets of property values. 

1. “BAYES nets” for combining independent properties 

We consider a single experiment in which an object is placed in front of a 
property-list machine. Each property E, will have a value, 0 or 1. Suppose 
that there has been defined some set of “object classes” Fy, and that we 
want to use the outcome of this experiment to decide in which of these 
classes the object belongs. 

Assume that the situation is basically probabilistic, and that we know 
the probability Pn that, if the object is in class Fy then the fth property E> 
will have value 1. Assume further that these properties are independent; 
that is, even given F h knowledge of the value of E ( tells us nothing more 1 
about the value of a different E k in the same experiment. (This is a strong 
condition— see below.) Let <f>, be the absolute probability that an object 
is in class Fy. Finally, for this experiment define V to be the particular set 
of fs for which the EC s are 1. Then this V represents the Character of 
the object. From the definition of conditional probability, we have 

Pr(F y ,F) = Pr(F) • Pr (F,|F) = Pr(Fy) • Pr(F|Fy) 

Given the Character V, we want to guess which F, has occurred (with the 
least chance of being wrong— the so-called maximum likelihood estimate); 
that is, for which / is Pr(FyiF) the largest? Since in the above Pr(F) does 
not depend on /, we have only to calcuate for which / is 

Pr(F,) • Pr(F|F y ) = <fcPr(F|Fy) 

the largest. Hence, by our independence hypothesis, we have to maximize 

** • n va • n ?.> - <t>i n ^ • n (d 

*=V ,<=K »Ui 

These “maximum-likelihood” decisions can be made (Fig. 6) by a simple 
network device. 7 

T At the cost of an additional network layer, we may also account for the possible 
cost gtu that would be incurred if we were to assign to F* a figure really in class F>; 
in this case the minimum cost decision is given by the k for which 

Y, n vu n 

i »ev *=/ 

is the least. V is the complement set to V. q, t is (1 — p u ). 
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Figure 6. “Net” model for maximum-likelihood decisions based on linear weightings 
of property values. The input data are examined by each “property filter” E,. 
Each Ei has “0” and “1" output channels, one of which is excited by each input. 
These outputs are weighted by the corresponding pit's, as shown in the text. 
The resulting signals are multiplied in the Ft units, each of which collects evidence 
for a particular figure class. [We could have used here log (pn), and added at 
the Ft units.] The final decision is made by the topmost unit D, who merely chooses 
that Ft with the largest score. Note that the logarithm of the coefficient Pu/qu 
in the second expression of ( 1 ) can be construed as the “weight of the evidence 
of Ei in favor of F,. [See also Papert (1961) and Rosenblatt (1958).] 

These nets resemble the general schematic diagrams proposed in the 
“Pandemonium” model of Selfridge (1959) (see his fig. 3). It is proposed 
there that some intellectual processes might be carried out by a hierarchy 
of simultaneously functioning submachines suggestively called “demons. 
Each unit is set to detect certain patterns in the activity of others and the 
output of each unit announces the degree of confidence of that unit that it 
sees what it is looking for. Our E\ units are Selfridge’s “data demons. 
Our units Fj are his “cognitive demons”; each collects from the abstracted 
data evidence for a specific proposition. The topmost “decision demon D 
responds to that one in the multitude below it whose shriek is the loudest/ 

It is quite easy to add to this “Bayes network model” a mechanism 
which will enable it to learn the optimal connection weightings. Imagine 
that, after each event, the machine is told which Fj has occurred; we 
could implement this by sending back a signal along the connections lead- 
ing to that Fj unit. Suppose that the connection for p i; - (of q a) contains 
a two-terminal device (or “synapse”) which stores a number w-,j. Whenever 
the joint event (F„ £ ( = 1) occurs, we modify by replacing it by 

* Sec also the report in Selfridge and Neisser ( I960). 
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(wh + 1)0* where 0 is a factor slightly less than unity. And when the 
joint event ( Fj , Ei = 0) occurs, we decrement w-,j by replacing it with 
(wu)0. It is not difficult to show that the expected values of the wj/s will 
become proportional to the Pi/s [and, in fact, approach Pi/[9/(l — 5)]. 
Hence, the machine tends to learn the optimal weighting on the basis of 
experience. (One must put in a similar mechanism for estimating the 
The variance of the normalized weight w, y [(l — 8) /&] approaches 
£0 ^)/(l 0]Pij(]ii> Thus a small value for 0 means rapid learning 

but is associated with a large variance, hence, with low reliability. Choosing 
9 close to unity means slow, but reliable, learning. 9 is really a sort of 
memory decay constant, and its choice must be determined by the noise 
and stability of the environment— much noise requires long averaging 
times, while a changing environment requires fast adaptation. The rivo 
requirements are, of course, incompatible and the decision has 'to be based 
on an economic compromise. 9 

2. POSSIBILITIES OF USING RANDOM NETS FOR BAYES DECISIONS 

The nets of Fig. 6 are very orderly in structure. Is all this structure neces- 
sary? Certainly if there were a great many properties,, each of which 
provided very little marginal information, some of them would not be 
missed. Then one might expect good results with a mere sampling of all 
the possible connection paths w tJ . And one might thus, in this special 
situation, use a random connection net. 

The two-layer nets here resemble those of the “Perceptron” proposal of 
Rosenblatt (1958). In the latter, there is an additional level of connections 
coming directly from randomly selected points of a “retina.” Here the 
properties, the devices which abstract, the visual input data, are simple 
functions which add some inputs, subtract others, and detect whether the 
result exceeds a threshold. Equation ( 1 ) , we think, illustrates what is of 
value in this scheme. It does seem clear that a maximum-likelihood type 
of analysis of the output of the property functions can be handled by such 
nets. But these nets, with their simple, randomly generated, connections 
can probably never achieve recognition of such patterns as “the -class of 
figures having two separated parts,” and ‘they cannot even achieve the effect 
of template recognition without size and position normalization (unless 
sample figures have been presented previously in essentially all sizes and 
positions). For the chances are extremely small of finding, by random 
methods, enough properties usefully correlated with patterns appreciably 
more abstract than those of the prototype-derived kind. And these net- 
works can really only separate out (by weighting) information in the in- 
dividual input properties; they cannot extract further information present 
in nonadditive form. The "Perceptron” class of machines have facilities 
neither for obtaining better-than-chance properties nor for assembling 

*Sce also Minsky and Selfridpe (I960), and Papcrt 1 1551). 
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better-than-additive combinations of those it gets from random construc- 
tion. 10 

For recognizing normalized printed or hand-printed characters, single- 
point properties do surprisingly well (Highleyman and Kamentsky, 1960); 
this amounts to just “averaging” many samples. Bledsoe and Browning 
(1959) claim good results with point-pair properties. Roberts (1960) 
describes a series of experiments in this general area. Doyle (1959) with- 
out normalization but with quite sophisticated properties obtains excellent 
results; his properties are already substantially size- 2 nd position-invariant. 
A general review of Doyle’s work and other pattern-recognition experi- 
ments will be found in Selfridge and Neisser ( 1960) . 

For the complex discrimination, e.g., between one and two connected 
objects, the property problem is very serious, especially for long wiggly 
objects such as are handled by Kirsch (1957). Here some kind of recursive 
processing is required and combinations of simple properties would almost 
certainly fail even with large nets and long training. 

We should not leave the discussion of some decision' net models without 
noting their important limitations. The hypothesis that, for given /, the p-,, 
represent independent events, is a very strong condition indeed. Without 
this hypothesis we could still construct maximum-likelihood nets, but we 
would need an additional layer of cells to represent all of the joint events 
V ; that is, we would need to know all the Pr(F ; |F). This gives a general 
(but trivial) solution, but requires 2 n cells for n properties, which is com- 
pletely impractical for large systems. What is required is a system which 
computes some sampling of all the joint conditional probabilities, and uses 
these to estimate others when needed. The work of Uttley (1956, 1959) 
bears on this problem, but his proposed and experimental devices do not 
yet clearly show how to avoid exponential growth. 11 

H. Articulation and Attention— Limitations of the Property-list Method 

Because of its fixed size, the property-list scheme is limited (for any given 
set of properties) in the detail of the distinctions it can make. Its ability to 
deal with a compound scene containing several objects is critically weak, 
and its direct extensions are unwieldy and unnatural. If a machine can 
recognize a chair and a table, it surely should be able to tell us that “there 
is a chair and a table.” To an extent, we can invent properties which allow 
some capacity for superposition of object Characters. 11 But there is no wa) 
to escape the information limit. 

"See also Roberts (1960), Papert (1961), and Hawkins (1958). We can find 
nothing resembling an analysis [see (1) above] in Rosenblatt (1958) or his sub- 
sequent publications. 

11 See also Papert (1961). 

u Cf. Mooers’ technique of Zatocoding (1956a, 19563). 
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Figure 7. The picture (a) is first described verbally in the text. Then, by introducing 
notation for the relations “inside of," "to the left of and “above,” we construct 
a symbolic description. Such descriptions can be formed and manipulated by 
machines. By abstracting out of the complex relation between the parts of the figure 
we can use the same formula to describe the related pictures ( b ) and (c), changing 
only the list of primitive parts. It is up to the programmer to decide at just what 
level of complexity a part of a picture should be considered “primitive"; this will 
depend on what the description is to be used for. We could further divide the 
drawings into vertices, lines, and arcs. Obviously, for some applications the relations 
would need more metrical information, e.g., specification of lengths or angles. 


What is required is clearly (1) a list (of whatever length is necessary) 
of the primitive objects in the scene and (2) a statement about the rela- 
tions among them. Thus we say of Fig. la, “A rectangle (1) contains two 
subfigures disposed horizontally. The part on the left is a rectangle (2) - 
which contains two subfigures disposed vertically; the upper a circle (3) and 
the lower a triangle (4). The part on the right . . . etc.” Such a descrip- 
tion entails an ability to separate or “articulate” the scene into parts. (Note 
that in this example the articulation is essentially recursive; the figure is 
first divided into two parts; then each part is described using the same 
machinery.) We can formalize this kind of description in an expression 
language whose fundamental grammatical form is a pair (R,L) whose 
first member R names a relation and whose second member L is an ordered 
list (x lr x 3 , . . . r r n ) of the objects or subfigures which bear that relation 
to one another. We obtain the required flexibility by allowing the members 
of the list L to contain not only the names of “elementary” figures but also 
subexpressions” of the form ( R,L ) designating complex subfigures. Then 
our scene above may be described by the expression 

[O, (□, (-», {(©, (□, (i, (O, A)))), (O, (O, (V, (O, O, O))))}))] 

where (q, (x,y)) means that y is contained in x; (~>,(x,y) ) means that 
y is to the right of x; (j,(x,y)) means-that >• is below x, and (A.(*.y,z)) 
means that y is to the right of x and z is underneath and between them. 
The symbols □, o. and A represent the indicated kinds of primitive 
geometric objects. This expression-pair description language may be re- 
garded as a simple kind of “list-structure” language. Powerful computer 
techniques have been developed, originally by Newell, Shaw and Simon, 
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for manipulating symbolic expressions in such languages for purposes of 
heuristic programming. (See the remarks at the end of Sec. IV. If some of 
the members of a list are themselves lists, they must be surrounded by 
exterior parentheses, and this accounts for the accumulation of paren- 
theses.) 

It may be desirable to construct descriptions in which the complex 
relation is extracted, e.g., so that we have an expression of the form FC 
where F is an expression which at once denotes the composite relation be- 
tween all the primitive parts listed in G. A complication arises in connec- 
tion with the “binding” of variables, i.e., in specifying the manner in 
which the elements of G participate in the relation F , This can be hindled 
in general by the “A” notation (McCarthy, 1960) but here we can just use 
integers to order the variables. 

For the given example, we could describe the relational part F by an 
expression 

O (1,— K© (2,i(3,4)), G (5, V (6,7,8)))) 

in which we now use a “functional notation”; “( ©, (*00 )’ ‘ s replaced by 
“O (_x,y ) ,” etc., making for better readability. To obtain the desired 
description, this expression has to be applied to an ordered list of primitive 
objects, which in this case is ( □,0,0,'A,0,0.0,0)‘ composite 
functional form allows us to abstract the composite relation. By changing 
only the object list we can obtain descriptions also of the objects in Fig. 
lb and c. 

The important thing about such “articular” descriptions is that they can 
be obtained by repeated application of a fixed set of pattern-recognition 
techniques. Thus we can obtain arbitrarily complex descriptions from a 
fixed complexity classification mechanism. The new element required in 
the mechanism (beside the capacity to manipulate the list structures) is 
the ability to articulate — to “attend fully” to a selected part of the picture 
and bring all one’s resources to bear on that part. In efficient problem- 
solving programs, we will not usually complete Such a description in a 
single operation. Instead, the depth or detail of description will be undei 
the control of other processes. These will reach deeper, or look more 
carefully, only when they have to, e.g., when the presently available descrip- 
tion is inadequate for a current goal. The author, together with L. Hode>. 
is working on pattern-recognition schemes using articular descriptions. B\ 
manipulating the formal descriptions we can deal with overlapping ar.d 
incomplete figures, and several other problems of the ‘ Gestalt type. 

It seems likely that as machines are turned toward more difficult prob- 
lem areas, passive classification systems will become less adequate, and 
we may have to turn toward schemes which are based more on internally 
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generated hypotheses, perhaps “error-controlled” along the lines proposed 
byMacKay (1956). 

Space requires us to terminate this discussion of pattern-recognition and 
description. Among the important works not reviewed here should be 
mentioned those of Bomba (1959) and Grimsdale et al. (1959), which 
involve elements of description, Unger (1959) and Holland (1960) for 
parallel processing schemes, Hebb (1949) who is concerned with physio- 
logical description models, and the work of the Gestalt psychologists, 
notably Kohler (1947), who have certainly raised, if not solved, a num- 
ber of important questions. Sherman. (1959), Haller (1959) and others 
have completed programs using line-tracing operations for topological 
classification. The papers of Selfridge (1955, 1956) have been a major 
influence on work in this general area. • 

See also Kirsch et al. (1957) for discussion of a number of interesting 
computer image processing techniques, and see Minot (1959) and Stevens 
(1957) for reviews of the reading machine and related problems. One 
should also examine some biological work, e.g., Tinbergen (1951) to see 
instances in which some discriminations which seem, at first glance very 
complicated are explained on the basis of a few apparently simple prop- 
erties arranged in simple decision trees. 

III. Learning Systems 

In order to solve a new problem, one should first try using methods 
similar to those that have worked on similar problems. To implement this 
basic learning heuristic ’ one must generalize on past experience, and 
one way to do this is to use success-reinforced decision models. These 
learning systems are shown to be averaging devices. Using devices which 
learn also which events are associated with reinforcement, i.e., reward, 
we can build more autonomous “secondary reinforcement” systems. In 
applying such methods to complex problems, one encounters a serious 
difficulty in distributing credit for success of a complex strategy among 
the many decisions that were involved. This problem can be managed by 
arranging for local reinforcement of partial goals within a hierarchy, and 
by grading the training sequence of problems to parallel a process of 
maturation of the machine’s resources. 

In order to solve a new problem one uses what might be called the basic 
learning heuristic — first try using methods similar to those which have 
worked, in the past, on similar problems. We want our machines, too, to 
benefit from their past experience. Since we cannot expect new situations 
to be precisely the same as old ones, any useful learning will have to in- 
volve generalization techniques. There are too many notions associated 
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with “learning” to justify defining the term precisely. But we may be sure 
that any useful learning system will have to use records of the past as 
evidence for more general propositions,' it must thus entail some commit- 
ment or other about “inductive inference.” (See Sec. VB.) Perhaps the 
simplest way of generalizing about a set of entities is through constructing 
a new one which is an “ideal,” or rather, a typical member of that set; the 
usual way to do this is to smooth away variation by some sort of averaging 
technique. And indeed we find that most of the simple learning devices do 
incorporate some averaging technique — often that of averaging some sort 
of product, thus obtaining a sort of correlation. We shall discuss this 
family of devices here, and some more abstract schemes in Sec. V. 

A. Reinforcement 

A reinforcement process is one in which some aspects of the behavior of a 
system are caused to become more (or less) prominent in the future as 
a consequence of the application of a “reinforcement operator Z, This 
operator is required to affect only those aspects of behavior for which 
instances have actually occurred recently. 

The analogy is with “reward” or “extinction” (not punishment) in ani- 
mal behavior. The important thing about this kind of process is that it is 
“operant” [a tenr of Skinner (1953)]; the reinforcement operator does 
not initiate behavior, but merely selects that which the Trainer likes from 
that which has occurred. Such a system must then contain a device M 
which generates a variety of behavior (say, in interacting with some en- 
vironment) and a Trainer who makes critical judgments in applying the 
available reinforcement operators. (See Fig. 8.) 

Let us consider a very simple reinforcement model. Suppose that on 
each presentation of a stimulus S an animal has to make a choice, e,g,, to 
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Figure 8. Parts of an “operant reinforcement” learning system. In response to a 
stimulus from the environment, the machine makes one of several possible responses. 
It remembers what decisions were made in choosing this response. Shortly there- 
after, the Trainer sends to the machine positive or negative reinforcement (reward) 
signal; this increases or decreases the tendency to make the same decisions in the 
future. Note that the Trainer need not know how to solve problems, but only how 
to detect success or failure, or relative improvement; his function is selective. The 
Trainer might be connected to observe the actual stimulus-response activity, or. 
in a more interesting kind of system, just some function of the state of the 
environment. 
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turn left or right, and that its probability of turning right, at the nth trial, 
is p n . Suppose that we want it to turn right. Whenever it does this we 
might “reward” it by applying the operator Z + ; 


p„+i = Z+(P*) - 6p n + (l — e) 0 < 9 < 1 

which moves p a fraction (1 — 6) of the way toward unity. 13 If we dislike 
what it does we apply negative reinforcement, 


P»-n = Z_(p„) = 9p n 

moving p the same fraction of the way toward 0. Some theory of such 
linear learning operators, generalized to several stimuli and responses, 
will be found in Bush and Mosteller (1955). We can show that the learn- 
ing result is an average weighted by an exponentially-decaying time factor: 
Let Z n be ±1 according to whether the nth event is rewarded or extin- 
guished and replace p n by c„ = 2p„ — 1 so that — 1 < c n < 1, as for a 
correlation coefficient. Then (with c* = 0) we obtain by induction 

n 

C „+1 = (1 - 0) £ 0"-% 

t-0 


and since 




Qn-% 


we can write this as 


Ca+1 




a) 


If the term Z\ is regarded as a product of (i) how the creature responded 
and (ii) which kind of reinforcement was given, then c n is a kind of cor- 
relation function (with the decay weighting) of the joint behavior of these 
quantities. The ordinary, uniformly weighted average has the same general 
form but with time-dependent 6\ 


y ) Ca : y (2) 

In (1) we have again the situation described in Sec. IIG1; a small value 
of 9 gives fast learning, and the possibility of quick adaptation to a chang- 
ing environment. A near-unity value of $ gives slow learning, but also 
smooths away uncertainties due to noise. As noted in Sec. IIG1, the re- 
sponse distribution comes to approximate the probabilities of rewards of 
the alternative responses, (The importance of this phenomenon has, I 
think, been overrated; it is certainly not an especially rational stratcav. 
One reasonable alternative is that of computing the numbers pa as indi- 

Properly, the reinforcement functions should depend both on the /?* s and on the 
previous reaction — reward should decrease p if our animal has just turned to the left. 
The notation in the literature is also somewhat confusing in this regard. 
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cated, but actually playing at each trial the “most likely” choice. Except 
in the presence of a hostile opponent, there is usually no reason to play a 
“mixed” strategy. 1 *) 

In Samuel’s coefficient-optimizing program (19596) [see Sec. IIIC1], 
there is a most ingenious compromise between the exponential and the 
uniform averaging methods: the value of N in (2) above begins at 16 and 
so remains until n = 16, then N is 32 until n = 32, and so on until n = 
256. Thereafter N remains fixed at 256. This nicely prevents violent fluc- 
tuations in c n at the start, approaches the uniform weighting for a while, 
and finally approaches the exponentially weighted correlation, all in a 
manner that requires very little computation effort! Samuel s program is at 
present the outstanding example of a game-playing program which matches 
average human ability, and its success (in real time) is attributed to a 
wealth of such elegancies, both in heuristics and in programming. 

The problem of extinction or “unlearning” is especially critical for com- 
plex, hierarchical, learning. For, once a generalization about the past has 
been made, one is likely to build upon it. Thus, one may come to select 
certain properties as important and begin to use them in the characteriza- 
tion of experience, perhaps storing one’s memories in terms of them. If 
later it is discovered that some other properties would serve better, then 
one must face the problem of translating, or abandoning, the records based 
on the older system. This may be a very high price to pay. One does not 
easily give up an old way of looking at things, if the better one demands 
much effort and experience to be useful. Thus the training sequences on 
which our machines will spend their infancies, so to speak, must be chosen 
very shrewdly to insure that early abstractions will provide a good founda- 
tion for later difficult problems. 

Incidentally, in spite of the space given here for their exposition, I am 
not convinced that such “incremental” or “statistical” learning schemes 
should play a central role in our models. They will certainly continue to 
appear as components of our programs but, I think, mainly by default. 
The more intelligent one is, the more often he should be able to learn 
from an experience something rather definite; e.g., to reject or accept a 
hypothesis, or to change a goal. (The obvious exception is that of a truly 
statistical environment in which averaging is inescapable. But the heart of 
problem-solving is always, we think, the combinatorial part that gives rise 
to searches, and we should usually be able to regard the complexities 
caused by “noise” as mere annoyances, however irritating they may be.) 
In this connection we can refer to the discussion of memory in Miller, 
Galanter and Pribram (I960). 15 This seems to be the first major work 

M The question of just how often one should play a strategy different from the 
estimated optimum, in order to gain information, is an underlying problem in many 
fields. See, e.g., Shubik (1960). 

Ji Scc especially chap. 10. 
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in psychology to show the influence of work in the artificial intelligence 
area, and its programme is generally quite sophisticated. 

B. Secondary Reinforcement and Expectation Models 

The simple reinforcement system is limited by its dependence on the 
Trainer. If the Trainer can detect only the solution of a problem, then we 
may encounter “mesa” phenomena which will limit performance on diffi- 
cult problems. (See Sec. I C.) One way to escape this is to have the ma- 
chine learn to generalize on what the Trainer does. Then, in difficult prob- 
lems, it may be able to give itself partial reinforcements along the way, 
upon the solution of relevant subproblems. The machine in Fig. 9 
has some such ability. The new unit U is a device that leams which exter- 
nal stimuli are strongly correlated with the various reinforcement signals, 
and responds to such stimuli by reproducing the corresponding reinforce- 
ment signals. (The device U is not itself a reinforcement learning device; 
it is more like a “Pavlovian” conditioning device, treating the Z signals' 
as “unconditioned” stimuli and the S signals as conditioned stimuli.) The 
heuristic idea is that any signal from the environment which in the past 
has been well correlated with (say) positive reinforcement is likely to be 
an indication that something good has just happened. If the training on 
early problems was such that this is realistic, then the system eventually 
should be able to detach itself from the Trainer, and become autonomous. 
If we further permit “chaining” of the “secondary reinforcers,” e.g., by 
admitting the connection shown as a dotted line in Fig. 9, the scheme* be- 
comes quite powerful, in principle. There are obvious pitfalls in admitting 



Figure 9. An additional device U gives the machine of Fig. 8 the ability to learn 
which signals from the environment have been associated with reinforcement. 
The primary reinforcement signals Z are routed through U . By a PavJovian 
conditioning process (not described here), external signals come to produce rein- 
forcement signals like those that have frequently succeeded them in the past. 
Such signals ^ might be abstract, e.g., verbal encouragement. If the “secondary 
reinforcement” signals are allowed, in turn, to acquire further external associations 
(through, e.g., a channel Z L as shown) the machine might come to be able to 
handle chains of subproblems. But something must be done to stabilize the system 
against the positive symbolic feedback loop formed by the path Z v . The profound 
difficulty presented by this stabilization problem may be reflected in the fact that, 
in lower animals, it is very difficult to demonstrate such chaining effects. 
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such a degree of autonomy; the values of the system may drift to a “non- 
adaptive” condition. 

C. Prediction and Expectation 

The evaluation unit U is supposed to acquire an ability to tell whether a 
situation is good or bad. This evaluation could be applied to imaginary 
situations as well as to real ones. If we could estimate the consequences 
of a proposed action (without its actual execution), we could use U to 
evaluate the (estimated) resulting situation. This could help in reducing 
the effort in search, and we would have in effect a machine with some 
ability to look ahead, or plan. In order to do this we need an additional 
device P which, given the description of a situation and an action, will pre- 
dict a description of the likely result.- (We will discuss schemes for doing 
this in Sec. I VC.) The device P might be constructed along the lines of 
a reinforcement learning device. In such a system the required reinforce- 
ment signals would have a very attractive character. For the machine must 
reinforce P positively when the actual outcome resembles that which 
was predicted — accurate expectations are rewarded. If we could further 
add a premium to reinforcement of those predictions which have a novel 
aspect, we might expect to discern behavior motivated by a sort of 
curiosity. In the reinforcement of mechanisms for confirmed novel 
expectations (or new explanations) we may find the key to simulation of 
intellectual motivation. 1 ® 

SAMUEL’S PROGRAM FOR CHECKERS 

In Samuel’s “generalization learning” program for the game of checkers 
(1959a) we find a novel heuristic technique which could be regarded as a 
simple example of the “expectation reinforcement” notion. Let us review 
very briefly the situation in playing two-person board games of this kind. 
As noted by Shannon (1956) such games are in principle finite, and a 
best strategy can be found by following out all possible continuations— if 
he goes there I can go there, or there, etc.— and then “backing up” or 
“minimaxing” from the terminal positions, won, lost, or drawn. But in 
practice the full exploration of the resulting colossal “move tree” is out of 
the question. No doubt, some exploration will always be necessary for 
such games. But the tree must be pruned. We might simply put a limit on 
depth of exploration— the number of moves and replies. We might also 
limit the number of alternatives explored from each position — this requires 
some heuristics for selection of “plausible moves.” 11 Now, if the backing-up 
technique is still to be used (with the incomplete move tree) one has to 

“See also chap. 6 of Minsky (1954). . 

«See the discussion of Bernstein (1958) and the more extensive review and 
discussion in the very suggestive paper of Newell, Shaw and Simon (19586). 


1 


i 

j 

■ 'j 

\ STEPS TOWARD ARTIFICIAL INTELLIGENCE 431 



Mo* Min Max Min Mox 

Figure 10. “Backing up" the static evaluations of proposed moves in a game tree. 
From the vertex at the left, representing the present position in a board game, 
radiate three branches, representing the players proposed moves. Each of these 
might be countered by a variety of opponent moves, and so on. According to 
some program, a finite tree is generated. Then the worth to the player of each 
terminal board position is estimated (see text). If the opponent has the same 
values, he will choose to minimize the score, while the player will always try 
to maximize. The heavy lines show how this minimaxing process backs up until a 
choice is determined for the present position. 

The full tree for chess has the order of .10 1 * branches — beyond the reach of any 
man or computer. There is a fundamental heuristic exchange between the effectiveness 
o the evaluation function and the extent of the tree. A very weak evaluation 
one which just compares the players* values of pieces) would yield a 
devastating game if the machine could explore all continuations out to, say, 
20 levels. But only 6 levels, roughly within the range of our presently largest 
computers, would probably not give a brilliant game; less exhaustive strategies, 
perhaps along the lines of Newell, Shaw, and Simon (19586), would be more 
profitable. 

substitute for the absolute “win, lose, or draw” criterion some other 
“static” way of evaluating nonterminal positions. 18 (See Fig. 10.) Perhaps 
the simplest scheme is to use a weighted sum of some selected set of 
property” functions of the positions — mobility, advancement, center con- 
trol, and the like. This is done in Samuel’s program, and in most of- its 
predecessors. Associated with this is a multiple-simultaneous-optimizer 
method for discovering a good coefficient assignment (using the correlation 
technique noted in Sec. III/l). But the source of reinforcement signals in 

“In some problems the backing-up process can be handled in closed analytic 
form so that one may be able to use such methods as Bellman’s “Dynamic Pro- 
gramming” (1957). Freimer (1960) gives some examples for which limited “look- 
ahead" doesn’t work. 
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Samuel (1959a) is novel. One cannot afford to play out one or more 
entire games for each single learning step. Samuel measures instead for 
each move the difference between what the evaluation function yields di- 
rectly of a position and what it predicts on the basis of an extensive con- 
tinuation exploration, i.e., backing up. The sign of this error. Delta, is 
used for reinforcement; thus the system may learn something at each 
move. 19 

D. The Basic Credit-assignment Problem for Complex Reinforcement 
Learning Systems 

In playing a complex game such as chess or checkers, or in writing a com- 
puter program, one has a definite success criterion — the game is won or 
lost. But in the course of play, each ultimate success (or failure) is asso- 
ciated with a vast number of internal decisions. If the run is successful, 
how can we assign credit for the success among the multitude of decisions? 
As Newell noted, 

It is extremely doubtful whether there is enough information in ' win, 
lose, or draw” when referred to the whole play of the game to permit 
any learning at all over available time scales. . . . For learning to 
take place, each play of the game must yield much more information. 
This is .. . achieved by breaking the problem into components. 
The unit of success is the goal. If a goal is achieved, its sub goals are 
reinforced; if not they are inhibited. ( Actually , what is reinforced is 
the transformation rule that provided the sub goal.) . . . This also 
is true of the other kinds of structure: every tactic that is created 
provides information about the success or. failure of tactic search 
rules; every opponent’s action provides information about success or 
failure of likelihood inferences; and so on. The amount of informa- 
tion relevant to learning increases directly with the number of niecha- 
nisms in the chess-pUiying machines 0 

We are in complete agreement with Newell on this approach to the 

problem. 21 , „ • . . „ 

It is my impression that many workers in the area of self-organizing 

systems and “random neural nets” do not feel the urgency of this pro 

“It should be noted that Samuel (1959a) describes also a rather successful 
checker-playing program based on recording and retrieving information about posi- 
tions encountered in the past, a less abstract way of exploiting past expcriei^e. 
Samuel’s work is notable in the variety of experiments that were performed « ,th 
and without various heuristics. This gives an unusual opportunity to really find out 
how different heuristic methods compare. More workers should choose (other 
things being equal) problems for which such variations are practicable. 

“See p. 108 of Newell (1955). 

n See also the discussion in Samuel (p. 22, 1959a) on assigning credit for a change 
in “Delta.” 
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Iem. Suppose that one million decisions are involved in a complex task 
(such as winning a chess game). Could we assign to each decision element 
one-millionth of the credit for the completed task? In certain special situa- 
tions we can do just this — e.g., in the machines of Rosenblatt (1958), 
Roberts (1960), and Farley and Clark (1954), etc., where the connec- 
tions being reinforced are to a sufficient degree independent. But the 
problem-solving ability is correspondingly weak. 

For more complex problems, with decisions in hierarchies (rather than 
summed on the same level) and with increments small enough to assure 
probable convergence, the running times would become fantastic. For 
complex problems we will have to define “success” in some rich local 
sense. Some of the difficulty may be evaded by using carefully graded 

“training sequences” as described in the following section. 

friedberg’s program-writing program 

An important example of comparative failure in this credit-assignment 
matter is provided by the program of Friedberg (1958,- 1959) to solve 
program-writing problems. The problem here is to write programs for a 
(simulated) very simple digital computer. A simple problem is assigned, 
e -g; “compute the AND of two bits in storage and put the result in an 
assigned location.” A generating device produces a random (64-instruc- 
tion) program. The program is run and its success or failure is noted. 
The success information is used to reinforce individual instructions (in 
fixed locations) so that each success tends to increase the chance that the 
instructions of successful programs will appear in later trials. (We lack 
space for details of how this is done.) Thus the program tries to find 
good ’ instructions, more or less independently, for each location in pro- 
gram memory. The machine did learn to solve some extremely simple 
problems. But it took of the order of 1000 times lonser than pure chance 
would expect. In part II of Friedberg et al. (1959), this failure is dis- 
cussed, and^ attributed in part to what we called (Sec. I C) the “Mesa 
phenomena.” In changing just one instruction at a time the machine had 
not taken large enough steps in its search through program space. 

The second paper goes on to discuss a sequence of modifications in 
the program generator and its reinforcement operators. With these, and 
with some “priming” (starting the machine off on the right track with 
some useful instructions), the system came to be only a little worse than 
chance. Friedberg et al. (1959) conclude that with these improvements 
t e generally superior performance of those machines with a success- 
number reinforcement mechanism over those without does serve to indi- 
cate that such a mechanism can provide a basis for constructing a iearn- 
tng machine.” I disagree with this conclusion. It seems to me that each of 
e improvements” can be interpreted as serving only to increase the step 
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size of the search, that is, the randomness of the mechanism; this helps to 
avoid the Mesa phenomenon and thus approach chance behavior. But it 
certainly does not show that the “learning mechanism” is working— one 
would want at least to see some better-than-chance results before arguing 
this point. The trouble, it seems, is with credit-assignment. The credit for 
a working program can only be assigned to functional groups of instruc- 
tions, e.g., subroutines, and as these operate in hierarchies^ we should not 
expect individual instruction reinforcement to work well.*- It seems sur- 
prising that it was not recognized in Friedberg et al. (1959) that the 
doubts raised earlier were probably justified! In the last section of Fned- 
berg et al. (1959) we see some real success obtained by breaking the 
problem into parts and solving them sequentially. (This successful demon- 
stration using division into subproblems does not use any reinforcement 
mechanism at all.) Some experiments of similar nature are reported in 

Kilbum, Grimsdale and Sumner (1959). 

It is my conviction that no scheme for learning, or for pattern recogni- 
tion, can have very general utility unless there are provisions for recursive, 
or at least hierarchical, use of previous results. We cannot expect a learn- 
ing system to come to handle very hard problems without preparing it 
with a reasonably graded sequence of problems of growing difficulty. The 
first problem must be one which can be solved in reasonable time with the 
initial resources. The next must be capable of solution in reasonable time 
by using reasonably simple and accessible combinations of methods de- 
veloped in the first, and so on. The only alternatives to this use of an 
adequate “training sequence” are (1) advanced resources, given initia y, 
or (2) the fantastic exploratory processes found perhaps only in the his- 
tory of organic evolution.* 3 And even there, if we accept the general view 
of Darlington (1958) who emphasizes the heuristic aspects of genetic 
systems, we must have developed early (in, e.g., the phenomena of meiosis 
and crossing-over) quite highly specialized mechanisms providing for the 
segregation of groupings related to solutions of subproblems. Recently, 
much effort has been devoted to the construction of training sequences in 
connection with programming “teaching machines.” Naturally, the psycho- 
logical literature abounds with theories of how complex behavior is built 


» See the introduction to Friedberg (1958) for a thoughtful discussion of the 

plausibility of the scheme. • . . ... -* n 

a It should, however, be possible to construct learning mechanisms which can 
select for themselves reasonably good training sequences (from an always complex 
environment) by prearranging a relatively slow development (or maturation ) of 
the system’s facilities. This might be done by prearranging that sequence of goals 
attempted by the primary Trainer match reasonably well, at each stage, the com- 
plexity of performance mechanically available to the patiern-recognilion and other 
parts of the system. One might be able to do much of this by simply limiting the 
depth of hierarchical activitv. perhaps only later permitting limited recursive activity. 


* 
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u ?fr° m sim P Ier - In our own area, perhaps the work of Solomonoff 
(1957), while overly cryptic, shows the most thorough consideration of 
this dependency on training sequences . 


IV. Problem-solving and Planning 

The solution, by machine, of really complex problems will require a 
variety of administration facilities. During the course of solving a problem, 
one becomes involved with a large assembly of interrelated subproblems. 
From these, at each stage, a very few must be chosen for investigation. 
Inis decision must be based on (1) estimates of relative difficulties and 
(2) estimates of centrality of the different candidates for attention Fol- 
lowing subproblem selection (for which several heuristic methods are 
proposed), one must choose methods appropriate to the selected problems. 
But for really difficult problems, even these step-by-step heuristics for 
reducing search will fail, and the machine must have resources for analyz- 
ing the problem structure in the large — in short, for “planning.” A num- 
ber of schemes for planning are discussed, among them the use of models 
—analogous, semantic, and abstract. Certain abstract models, “Character- 
Algebras,” can be constructed by the machine itself, on the basis of ex- 
perience or analysis. For concreteness, the discussion begins with a descrip- 
tion of a simple but significant system (LT) which encounters some of 
these problems. 

A. The “Logic Theory” Program of Newell, Shaw and Simon 

It is not surprising that the testing grounds for early work on mechanical 
problem-solving have usually been areas of mathematics, or games, in 
which the rules are defined with absolute clarity. The “Logic Theory” 
machine of Newell and Simon (1956a, 1957a), called “LT” below was a 
first attempt to prove theorems in logic, by frankly heuristic methods. 
Although the program was not by human standards a brilliant success 
(and did not surpass its designers), it stands as a landmark both in 
heuristic programming and also in the development of modem automatic 
programming. 

The problem domain here is that of discovering proofs in the Russell- 
Whitehead system for the propositional calculus. That system is given as a 
set of (five) axioms and (three) rules of inference; the latter specify how 
certain transformations can be applied to produce new theorems from old 
theorems and axioms. 

The LT program is centered around the idea of “working backward” to 
find a proof. Given a theorem T to be proved, LT searches among the 
axioms and previously established theorems for one from which T can be 
deduced by a single application of one of three simple “Methods” (which 
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embody the given rules of inference). If one is found, the problem is 
solved. Or the search might fail completely. But finally, the search may 
yield one or more “problems” which are usually propositions from which 
T may be deduced directly. If one of these can, in turn, be proved a 
theorem the main problem will be solved. (The situation is actually slightly 
more complex.) Each such subproblem is adjoined to the “subproblem 
list” (after a limited preliminary attempt) and LT works around to it later. 
The full power of LT, such as it is, can be applied to each subproblem, 
for LT can use itself as a subroutine in a recursive fashion. 

The heuristic technique of working backward yields something of a 
teleological process, and LT is a forerunner of more complex systems 
which construct hierarchies of goals and subgoals. Even so, the basic ad- 
ministrative structure of the program is no more than a nested set of 
searches through lists in memory. We shall first outline this structure and 
then mention a few heuristics that were used in attempts to improve 
performance. 

1. Take the next problem from problem list. 

(If there are no more problems, EXIT with total failure.) 

2. Choose the next of the three basic Methods. 

(If no more methods, go to 1.) 

3. Choose the next member of the list of axioms and previous theorems. 

(If no more, go to 2.) ... ,, 

Then apply the Method to the problem, using the chosen theorem 

or axiom. 

If problem is solved, EXIT with complete proof. 

If no result, go to 3. 

If new subproblem arises, go to 4. 

4. Try the special (substitution) Method on the subproblem. 

If problem is solved, EXIT with complete proof. _ 

If no result, put the subproblem at the end of the problem list and 

go to 3. 

Among the heuristics that were studied were (1) a similarity test to 
reduce the work in step 4 (which includes another search through the 
theorem list), (2) a simplicity test to select apparently easier problems 
from the problem list, and (3) a strong nonprovability test to remove from 
the problem list expressions which are probably false and hence not prov- 
able. In a series of experiments “learning” was used to find which earlier 
theorems had been most useful and should be given priority in step 3. 
We cannot review the effects of these changes in detail. Of interest was the 
balance between the extra cost for administration of certain heuristics an 
the resultant search reduction; this balance was quite delicate in some 
cases when computer memory became saturated. The system seemed to be 
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quite sensitive to the training sequence-the order in which problems 
were given. And some heuristics which gave no significant over-all im- 
provement did nevertheless affect the class of solvable problems. Curiouslv 
enough the general efficiency of LT was not greatly improved by any or 
all of these devices. But all this practical experience is reflected in the de- 

Se^ IVD2 mUCh m ° re S ° phiSticated “ GPS ” s y stem Ascribed briefly in 


Wang (I960) has criticized the LT project on the grounds that there 
exist, as he and others have shown, mechanized proof methods which, for 
the particular run of problems considered, use far less machine effort than 
does LT and which have the advantage that they will ultimately find a 
proof for any provable proposition. (LT does not have this exhaustive 

decision procedure” character and can fail ever to find proofs for some 
theorems.) The authors of “Empirical Explorations of the Logic Theory 
Machine, perhaps unaware of the existence of even moderately efficient 
exhaustive methods, supported their arguments by comparison with a par- 
ticularly inefficient exhaustive procedure. Nevertheless, I feel that some of 
Wangs criticisms are misdirected. He does not seem to recognize that the 
authors of LT are not so much interested in proving these theorems as 
they are m the general problem of solving difficult problems. The com- 
binatorial system of Russell and Whitehead (with which LT deals) is far 
less simple and elegant than the system used by Wang. 21 [Note, e.g., the 
emphasis in Newell, Shaw and Simon (1958a, 19585).] Wang’s problems 
white logically equivalent, are formally much simpler. His methods do not 
include any facilities for using previous results (hence they arc sure to 
egrade rapidly at a certain level of problem complexity), white LT is 
fundamentally oriented around this problem. Finally, because of the very 
e ectiveness of Wang’s method on the particular set of theorems in ques- 
tion, he simply did not have to face the fundamental heuristic problem of 
when to decide to give up on a line of attack. Thus the formidable per- 
formance of his program (1960) perhaps diverted his attention from 
heuristic problems that must again spring up when real mathematics is 
ultimately encountered. 

This is not meant as a rejection of the importance of Wang’s work and 
discussion. He and others working on “mechanical mathematics”-have dis- 
covered that there are proof procedures which are much more efficient 
than has been suspected. Such work will unquestionably help in construct- 
ing intelligent machines, and these procedures will certainly be preferred 
when available, to “unreliable heuristic methods.” Wang, Davis and 


wangs procedure (1960a), too, works backward, and can be regarded rs a 

fn T e ‘ h0d of Jaldfication” for deciding iruih-funcional lautoteey. 

o ! h i i , “ S U T ed SCqUCl ’ hc introduccs ™re powerful methods 
(for much more difficult problems). 
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Putnam, and several others are now pushing these new techniques into 
the far more challenging domain of theorem proving in the predicate cal- 
culus (for which exhaustive decision procedures are no longer available). 
We have no space to discuss this area, 25 but it seems clear that a program 
to solve real mathematical problems will have to combine the mathemati- 
cal sophistication of Wang with the heuristic sophistication of Newell, 

Shaw and Simon. 5 ® 

B. Heuristics for Subproblem Selection 

In designing a problem-solving system, the programmer often comes 
equipped with a set of more or less distinct “Methods”— his real task is to 
find an efficient way for the program to decide where and when the differ- 
ent methods are to be used. 

Methods which do not dispose of a problem may still transform it to 
create new problems or subproblems. Hence, during the course of so vmg 
one problem we may become involved with a large assembly of mterrelated 
subproblems. A “parallel” computer, yet to be conceived, might work on 
many at a time. But even the parallel machine must have procedures to 
allocate its resources because it cannot simultaneously app y a its me 
ods to all the problems. We shall divide this administrative problem into 
two parts: the selection of those subproblem(s) which seem most critical 
attractive, or otherwise immediate, and, in the next section, the choice of 

which method to apply to the selected problem. . 

In the basic program for LT (Sec. IVA), subproblem selection is very 
simple. New problems are examined briefly and (if not solved at once) 
are placed at the end of the (linear) problem list. The mam program 
proceeds along this list (step 1), attacking the problems m the order of 
their generation. More powerful systems will have to be more judicious 
(both in generation and selection of problems) for only thus can excessive 
branching be restrained. 51 In more complex systems we can expect to 
consider for each subproblem, at least these two aspects: (1) lts a PP are £ 
“centrality”— how will its solution promote the mam goal, and U) 
apparent “difficulty”— how much effort is it liable to consume We need 
heuristic methods to estimate each of. these quantities and, furth , 

•See Davis and Putnam (1960), and Wang (I960*). 

-All these efforts are directed toward the reduction of search effort. In that^sense 
thev are all heuristic programs. Since practically no one still uses heuristic m a 
TpplJ » 'SgoriLic.- serious workers migh. do well » 
argument on this score. The real problem is to find methods which signific y y 
the anoarentiv inevitable exponential growth of search trees. 

^ "X that the simple scheme of LT has the property that each generated problem 

will eventually get attention, even if several are created map. 

full attention to each problem, as generated, one might never return to alternate 

branches. 
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select accordingly one of the problems and allocate to it some reasonable 
quantity of effort. 33 Little enough is known about these matters, and so it 

is not entirely for lack of space that the following remarks are somewhat 
cryptic. 

. Imagine that the problems and their relations are arranged to form some 
kind of directed-graph structure (Minsky, 19566: Newell and Simon 

19 « 66 ,. : if IenJter and Rochester ’ 1958). The main problem is to establish 
a valid path, between two initially distinguished nodes. Generation of 
new problems is represented by the addition of new, not-yet-valid paths 
or by the insertion of new nodes in old p-ths. Then problems are repre- 
sented by not-yet-valid paths, and “centrality” by location in the structure. 
Associate with each connection, quantities describing its current validity 
state (solved, plausible, doubtful, etc.) and its current estimated difficulty. 

1. GLOBAL METHODS 


The most general problem-selection methods are “global” — at each step 
they look over the entire structure. There is one such simple scheme 
which works well on at least one rather degenerate interpretation of our 
problem graph. This is based on an electrical analogy suggested to us by 

^ o m ^ hin l de J iSnCd . by Sh3nn0n [related t0 one described in Shannon 
(1955) which describes quite a variety of interesting game-playing and 
earning machines] to play a variant of the game marketed as “Hex” (and 
known among mathematicians as “Nash”). The initial board position can 
be represented as a certain network of resistors. (See Fig. 11.) One play- 
er’s goal is to construct a short-circuit path between two given boundaries; 
the opponent tries to open the circuit between them. Each move consists 
of shorting (or opening), irreversibly, one of the- remaining resistors, 
shannon s machine applies a potential between the boundaries and selects 
that resistor which carries the largest current. Very roughly speaking, this 
resistor is likely to be most critical because changing it will have the largest 
effect on the resistance of the net and, hence, in the goal direction of 
s orting (or opening) the circuit. And although this argument is not per- 
ect, nor is this a perfect model of the real combinatorial situation the 
machine does play extremely well. (It can make unsound moves in certain 

artificial situations, but no one seems to have been able to force this during 
a game.) 5 

The use of such a global method for problem selection requires that 
the available “difficulty estimates” for related subproblems be arranged to 

si ;°r WiU Wan - Se V f the considered P robler " * 'he same as one already con- 
. , * or y ery similar. See the discussion in Gelernter and Rochester (1958) This 

l(TZir B u handl f d m0re senera,ly by sim P | - v remembering the (Characters 
c l bv ^hL T' bCen and checki "* ones against .his memory, 

match * th ^ S ° f Mooers (1 956), looking more closely if there seems to be a 
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Figure 11. This board game (due to C. E. Shannon) is played on a network 
of equal resistors. The first player’s goal is to open the circuit between the end 
points; the second player’s goal is to short the circuit. A move consuls of opening 
or shortening a resistor. If the first player begins by opening resistor 1 the second 
player might counter by shorting resistor 4, following the strategy described in 
the text. The remaining move pairs (if both players use that strategy) would be 
(5 8) (9 13) (12, 10 or 2) (2 or 10 win). In this game the nrst player should 
be’ able to force a win, and the maximum-current strategy seems, always to do so. 
even on larger networks. 

combine in roughly the manner of resistance values. Also, we could re- 
gard this machine as using an “analog model” for “planning.” (See Sec. 
IVD.) 29 

2. LOCAL, AND “HEREDITARY,” METHODS 
The prospect of having to study at each step the whole problem structure 
is discouraging, especially since the structure usually changes only slightly 
after each attempt. One naturally looks for methods which merely update 
or modify a small fragment of the stored record. Between the extremes of 
the “first-come-first-served” problem-list method and the full global-survey 
methods, lie a variety of compromise techniques. Perhaps the most attrac- 
tive of these are what we will call the Inheritance methods— essenttally 

recursive devices. . , 

In an Inheritance method, the effort assigned to a subproblem is deter- 
mined only by its immediate ancestry; at the time each problem is created 
it is assigned a certain total quantity Q of time or effort. When a problem 
is later split into subproblems, such quantities are assigned to them by 
some local process which depends only on their relative merits and on what 
remains of Q. Thus the centrality problem is managed implicitly. Such 
schemes are quite easy to program, especially with the new programming 
systems such as IPL (Newell and Tonge, 1960c) and LISP (McCarthy. 
1960) (which are themselves based oh certain hereditary or recursive op- 
erations). Special cases of the Inheritance method arise when one can get 
along with a simple all-or-none Q, e.g., a “stop condition’ —this yields the 

» A variety of combinatorial methods will be matched against the network-analogy 
opponent in a program being completed by R. Silver, Lincoln Laboratory, MIT. 

Lexington, Mass. 
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exploratory method called “backtracking” by Golumb (1961). The decod- 
ing procedure of Wozencraft (1961) is another important variety of In- 
heritance method. 

In the complex exploration process proposed for chess by Newell, Shaw, 
and Simon (19586) we have a form of Inheritance method with a non- 
numerical stop condition. Here, the subproblems inherit sets of goals to be 
achieved. This teleological control has to be administered by an additional 
goal-selection system and is further complicated by a global (but reason- 
ably simple) stop rule of the back-up variety (Sec. IIIC). (Note: we are 
identifying here the move-tree-limitation problem with that of problem 
selection.) Even though extensive experimental results are not yet avail- 
able, we feel that the scheme of Newell, Shaw, and Simon (19586) de- 
serves careful study by anyone planning serious work in this area. It 
shows only the beginning of the complexity sure to come in our develop- 
ment of intelligent machines. 30 

C. “Character-Method” Machines 

Once a problem is selected, we must decide which method to try first. This 
depends on our ability to classify or characterize problems. We first com- 
pute the Character of our problem (by using some pattern recognition 
technique) and then consult a “Character-Method” table or other device 
which is supposed to tell us which method (s) are most effective on prob- 
lems of that Character. This information might be built up from experi- 
ence, given initially by the programmer, deduced from “advice” (Mc- 
Carthy, 1959), or obtained as the solution to some other problem, as 
suggested in the GPS proposal (Newell, Shaw and Simon, 1959a). In any 
case, this part of the machine’s behavior, regarded from the outside, can 
be treated as a sort of stimulus-response, or “table look-up,” activity. 

If the Characters (or descriptions) have too wide a variety of values, 
there will be a serious problem of filling a Character-Method table. One 
might then have to reduce the detail of information, e.g., by using only a 
few important properties. Thus the Differences of GPS (see Sec. IVD2) 
describe no more than is necessary to define a single goal, and a priority 
scheme selects just one of these to characterize the situation. Gelemter 
and Rochester (1958) suggest using a property-weishting scheme, a spe- 
cial case of the “Bayes net” described in Sec. IIG. 

D. Planning 

Ordinarily one can solve a complicated problem only by dividing it into a 
number of parts, each of which can be attacked by a smaller search (or 
be further divided). Generally speaking, a successful division will reduce 

Some further discussion of this question may be found in Slagle (1961), 
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the search time not by a mere fraction, but by a fractional exponent. In 
a graph with 10 branches descending from each node, a 20-step search 
might involve 10 30 trials, which is out of the question, while the insertion 
of just four lemmas or sequential subgoals might reduce the search to onl\ 

5 X 10 4 trials, which is within reason for machine exploration. Thus it wilt 
be worth a relatively enormous effort to find such islands in the solution 
of complex problems. 31 Note that even if one encountered, say, 10° fail- 
ures of such procedures before success, one would still have gained a fac- 
tor of perhaps 10 10 in over-all trial reduction! Thus practically any ability 
at all to “plan" or “analyze," a problem will be profitable, if the problem 
is difficult. It is safe to say that all simple, unitary, notions of how to 
build an intelligent machine will fail, rather sharply, for some modest lev^l 
of problem difficulty. Only schemes which actively pursue an analysis 
toward obtaining a set of sequential goals can be expected to extend 
smoothly into increasingly complex problem domains. 

Perhaps the most straightforward concept of planning is that of using a 
simplified model of the problem situation. Suppose that there is available, 
for a given problem, some other problem of “essentially the same char- 
acter” but with less detail and complexity. Then we could proceed first 
to solve the simpler problem. Suppose, also, that this is done using a sec- 
ond set of methods, which are also simpler, but in some correspondence 
with those for the original. The solution to the simpler problem can then 
be used as a “plan" for the harder one. Perhaps each step will have to be 
expanded in detail. But the multiple searches will add, not multiply, in the 
total search time. The situation would be ideal if the model were, mathe- 
matically, a homomorphism of the original. But even without such per- 
fection the model solution should be a valuable guide. In mathematics 
one’s proof procedures usually run along these lines: one first assumes, 
e.g., that integrals and limits always converge, in the planning stage. Once 
the outline is completed, in this simpleminded model of mathematics, then 
one goes back to try to “make rigorous” the steps of the proof, i.e., to 
replace them by chains of argument using genuine rules of inference. And 
even if the plan fails, it may be possible to patch it by replacing just a few 
of its steps. 

Another aid to planning is the semantic, as opposed to the homomor- 
phic, model (Minsky, 1956a, 1959a). Here we may have an interpretation 
of the current problem within another system, not necessarily simpler, but 
with which we are more familiar and have already more powerful methods. 
Thus, in connection with a plan for the proof of a theorem, we will want 
to know whether the proposed lemmas, or islands in the proof, are —* 
tually true; if not, the plan will surely fail. We can often easily tell if a 
proposition is true by looking at an interpretation. Thus the truth ol a 

n See see. 10 of Ashby (1956). 
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proposition from plane geometry can be supposed, at least with great re- 
liability, by actual measurement of a few constructed drawings (or the 
analytic geometry equivalent). The geometry machine of Gelernter and 
Rochester (1958, 1959) uses such a semantic model with excellent re- 
sults; it follows closely the lines proposed in Minsky (1956a). 

1. THE “CHARACTER-ALGEBRA” MODEL 

Planning with the aid of a model is of the greatest value in reducing search. 
Can we construct machines which find their own models? I believe the 
following will provide a general, straightforward way to construct certain 
kinds of useful, abstract models. The critical requirement is that we be 
able to compile a “Character-Method Matrix” (in addition to the simple 
Character-Method table in Sec. IVC). The CM matrix is an array of en- 
tries which predict with some reliability what will happen when methods 
are applied to problems. Both of the matrix dimensions are indexed by 
problem Characters, if there is a method which usually transforms prob- 
lems of character C, into problems of character Cj then let the matrix 
entry C;,- be the name of that method (or a list of such methods). If 
there is no such method the corresponding entry is null. 

Now suppose that there is no entry for C,/ — meaning that we have no 
direct way to transform a problem of type C ; into one of type C h Multiply 
the matrix by itself. If the new matrix has a non-null (/,/) entry then there 
must be a sequence of two methods which effects the desired transforma- 
tion. If that fails, we may try higher powers. Note that [if we put unity 
for the (/,/) terms] we can reach the 2" matrix power with just n multipli- 
cations. We don’t need to define the symbolic multiplication operation; 
one may instead use arithmetic entries — putting unity for any non-null 
entry and zero for any null entry in the original matrix. This yields a sim- 
ple connection, or flow diagram, matrix, and its nth power tells us some- 
thing about its set of paths of length 2 n . 3 - [Once a non-null entry is dis- 
covered, there exist efficient ways to find the corresponding sequences of 
methods. The problem is really just that of finding paths through a maze, 
and the method of Moore (1959) would be quite efficient. Almost any 
problem can be converted into a problem of finding a chain between two 
terminal expressions in some formal system.] If the Characters are taken 
to be abstract representations of the problem expressions, this “Character- 
Algebra model can be as abstract as are the available pattern-recognition 
facilities. See Minsky (1956a, 1959a). 

The critical problem in using the Character-Algebra model for plan- 
ning is, of course, the prediction reliability of the matrix entries. One can- 
not expect the Character of a result to be strictly determined by the Char- 
acter of the original and the method used. And the reliability of the pre- 
“See, e.g„ Hohn, Seshu, and Aufenkamp ( 1957 ). 
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dictions will, in any case, deteriorate rapidly as the matrix power is raised. 
But, as we have noted, any plan at all is so much better than none that 
the system should do very much better than exhaustive search, even with 
quite poor prediction quality. 

This matrix formulation is obviously only a special case of the char- 
acter planning idea. More generally, one will have descriptions, rather than 
fixed characters, and one must then have more general methods to calcu- 
late from a description what is likely to happen when a method is applied. 

2. CHARACTERS AND DIFFERENCES 

In the GPS (General Problem Solver) proposal of Newell, Shaw, and 
Simon (1959a, 1960a) we find a slightly different framework: they use a 
notion of Difference between two problems (or expressions) where we 
speak of the Character of a single problem. These views are equivalent 
if we take our problems to be links or connections between expressions. 
But this notion of Difference (as the Character of a pair) does lend itself 
more smoothly to teleological reasoning. For what is the goal defined by 
a problem but to reduce the “difference” between the present state and the 
desired state? The underlying structure of GPS is precisely what we have 
called a “Character-Method Machine” in which each kind of Difference 
is associated in a table with ofie or more methods which are known to 
“reduce” that Difference. Since the characterization here depends always 
on (1) the current problem expression and (2) the desired end result, 
it is reasonable to think, as its authors suggest, of GPS as using “means- 
end” analysis. 

To illustrate the use of Differences, we shall review an example (Newell, 
Shaw, and Simon, 1960a). The problem, in elementary propositional cal- 
culus, is to prove that from S A ( — P D Q) we can deduce ( Q VP) AS. 
The program looks at both of these expressions with a recursive matching 
process which branches out from the main connectives. The first Differ- 
ence it encounters is that S occurs on different sides of the main connective 
“A.” It therefore looks in the Difference-Method table under the heading 
“change position.” It discovers there a method which uses the theorem 
(A A B) = (B A A) which is obviously useful for removing, or “reduc- 
ing,” differences of position. GPS applies this method, obtaining 

( P D Q) A S. GPS now asks what is the Difference between this new 

expression and the goal. This time the matching procedure gets down into 
the connectives inside the left-hand members and finds a Difference be- 
tween the connectives “D” and “V.” It now looks in the CM table under 
the heading “Change Connective” and discovers the appropriate method 
using ( — A D B) == (A V B). It applies this method, obtaining 
(P v Q) A 5. In the final cycle, the difference-evaluating procedure discov- 
ers the need for a “change position” inside the left member, and applies a 
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method using (A V B) == (B V A). This completes the solution of the 
problem. 33 

Evidently, the success of this “means-end” analysis in reducing general 
search will depend on the degree of specificity that can be written into the 
Difference-Method table — basically the same requirement for an effective 
Character-Algebra. 

It may be possible to plan using Differences, as well. One might imag- 
ine a ‘ Difference-Algebra” in which - the predictions have the form D = 
D' D". One must construct accordingly a difference-factorization algebra 
for discovering longer chains D = D x • • • D n and corresponding method 
plans. We should note that one cannot expect to use such planning meth- 
ods with such primitive Differences as are discussed in Newell, Shaw, and 
Simon (1960a); for these cannot form an adequate Difference-Algebra (or 
Character-Algebra). Unless, the characterizing expressions have many 
levels of descriptive detail, the matrix powers will too swiftly become de- 
generate. This degeneracy will ultimately limit the capacity of any formal 
planning device. 

One may think of the general planning heuristic as embodied in a re- 
cursive process of the following form. Suppose we have a problem P: 

1. Form a plan for problem P. 

2. Select first (next) step of the plan. 

(If no more steps, exit with “success.”) 

3. Try the suggested method(s): 

Success: return to (b), i.e., try next step in the plan. 

Failure: return to ( a ), i.e., form new plan, or perhaps change cur- 
rent plan to avoid this step. 

Problem judged too difficult: Apply this entire procedure to the 
problem of the current step. 

Observe that such a program schema is essentially recursive; it uses itself 
as a subroutine (explicitly, in the last step) in such a way that its current 
state has to be stored, and restored when it returns control to itself.* 14 

* Compare this with the “matching” process described in Newell and Simon 
(1956). The notions of “Character,” “Character-Algebra” etc., originate in Minsky 
(1956) but seem useful in describing parts of the “GPS” system of Newell and Simon 
(1956) and Newell, Shaw, and Simon (1960<a). The latter contains much additional 
material we cannot survey here. Essentially, GPS is to be sdf-applied to the problem 
of discovering sets of Differences appropriate for given problem areas. This notion of 
“bootstrapping” — applying a problem-solving system to the task of improving seme 
of its own methods — is old and familiar, but in Newell, Shaw, and Simon (1 960:?) 
we find perhaps the first specific proposal about how such an advance micht be 
realized. 

This violates, for example, the restrictions on “DO loops” in programming sys- 
tems such as FORTRAN. Convenient techniques for programming such processes 
were developed by Newell, Shaw and Simon (1960£); the program state variables 
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Miller, Galanter and Pribram 35 discuss possible analogies between hu- 
man problem-solving and some heuristic planning schemes. It seems cer- 
tain that, for at least a few years, there will be a close association be- 
tween theories of human behavior and attempts to increase the intellectual 
capacities of machines. But, in the long run, we must be prepared to dis- 
cover profitable lines of heuristic programming which do not deliberately 
imitate human characteristics . 36 

V. Induction and Models 
A. Intelligence 

In all of this discussion we have not come to grips with anything we can 
isolate as “intelligence.” We have discussed only heuristics, shortcuts, and 
classification techniques. Is there something missing? I am confident that 
sooner or later we will be able to assemble programs of great problem- 
solving ability from complex combinations of heuristic devices- — multiple 
optimizers, pattern- recognition tricks, planning algebras, recursive ad- 
ministration procedures, and the like. In no one of these will we find the 


are stored in “pushdown lists” and both the program and the data are stored in the 
form of “list structures.” Gelernter (1959) extended FORTRAN to manage some 
of this. McCarthy has extended these notions in LISP (1960) to permit explicit 
recursive definitions of programs in a language based on recursive functions ot* 
symbolic expressions; here the management of program state variables is fully 
automatic. See also Orchard-Hays (1960). 

“See chaps. 12 and 13 of Miller, Galanter, and Pribram (1960). 

“Limitations of space preclude detailed discussion here of theories of self-organiz- 
ing neural nets, and other models based on brain analogies. [Several of these are 
described or cited in Proceedings of a Symposium on Mechanisation of Thought 
Processes , London: H. M. Stationery Office, 1959, and Self Organizing Systems , 
M. T. Yovitts and S. Cameron (eds.). New York: Pergamon Press, I960.] This 
omission is not too serious, I feel, in connection with the subject of heuristic pro- 
gramming, because the motivation and methods of the two areas seem so different. 
Up to the present time, at least, research on neural-net models has been concerned 
mainly with the attempt to show that certain rather simple heuristic processes, e.g.. 
reinforcement learning, or property-list pattern recognition, can be realized or evolved 
by collections of simple elements without very highly organized interconnections. 
Work on heuristic programming is characterized quite differently by the search for 
new, more powerful heuristics for solving very complex problems, and by very little 
concern for what hardware (neuronal or otherwise) would minimally suffice for its 
realization. In short, the work on “nets” is concerned with how far one can get with 
a small initial endowment; the work on “artificial intelligence” is concerned v. tin 
using all we know to build the most powerful systems that we can. It is my expecta- 
tion that, in problem-solving power, the (allegedly brainlike) minimal-structure 
systems will never threaten to compete with their more deliberately designed contem- 
poraries; nevertheless, their study should prove profitable in the development ot 
component elements and subsystems to be used in the construction of the more 
systematically conceived machines. 
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seat of intelligence. Should we ask what intelligence “really is”? My own 
view is that this is more of an aesthetic question, or one of sense of dignity, 
than a technical matter! To me “intelligence” seems to denote little more 
than the complex of performances which we happen to respect, but do 
not understand. So it is, usually, with the question of “depth” in mathe- 
matics. Once the proof of a theorem is really understood its content seems 
to become trivial. (Still, there may remain a sense of wonder about how 
the proof was discovered.) 

Programmers, too, know that there is never any “heart” in a program. 
There are high-level routines in each program, but all they do is dictate 
that “if such and such, then transfer to such and such a subroutine.” And 
when we look at the low-level subroutines, which “actually do the work,” 
we find senseless loops and sequences of trivial operations, merely carry- 
ing out the dictates of their superiors. The intelligence in such a system 
seems to be as intangible as becomes the meaning of a single common 
word when it is thoughtfully pronounced over and over again. 

But we should not let our inability to discern a locus of intelligence lead 
us to conclude that programmed computers therefore cannot think. For it 
may be so with man, as with machine, that, when we understand finally 
the structure and program, the feeling of mystery (and self-approbation) 
will weaken. 37 We find similar views concerning “creativity” in Newell, 
Shaw, and Simon (1958c). The view expressed by Rosenbloom (1951) 
that minds (or brains) can transcend machines is based, apparently, on an 
erroneous interpretation of the meaning of the “unsolvability theorems” of 
Godel. 38 

"See Minsky (1956, 1959). 

"On problems of volition we are in general agreement with McCulloch (1954) 
that our freedom of will “presumably means no more than that we can distinguish 
between what we intend (/.*., our plan), and some intervention in our action.” See 
also MacKay (1959) and [the] references; we are, however, unconvinced by his 
eulogization of “analog” devices. Concerning the “mind-brain” problem, one should 
consider the arguments of Craik (1952), Hayek (1952), and Pask (1959). Among 
the active leaders in modern heuristic programming, perhaps only Samuel (19606) 
has taken a strong position against the idea of machines thinking. His argument, 
based on the fact that reliable computers do only that which they are instructed to 
do, has a basic flaw; it does not follow that the programmer therefore has full 
knowledge (and therefore full responsibility and credit for) what will ensue. For 
certainly the programmer may set up an evolutionary system whose limitations are 
for him unclear and possibly incomprehensible. No better does the mathematician 
know all the consequences of a proposed set of axioms. Surely a machine has to be 
in order to perform. But we cannot assign all the credit to its programmer if the 
operation of a system comes to reveal structures not recognizable or anticipated 
by the programmer. While we have not yet seen much in the way of intelligent 
activity in machines, Samuel’s arguments (circular in that they are based on the 
presumption that machines do not have minds) do not assure us against this. Turing 
(1956) gives a very knowledgeable discussion of such matters. 
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B. Inductive Inference 

Let us pose now for our machines, a variety of problems more challenging 
than any ordinary game or mathematical puzzle. Suppose that we want 
a machine which, when embedded for a time in a complex environment or 
“universe,” will essay to produce a description of that world — to dis- 
cover its regularities or laws of nature. We might ask it to predict what 
will happen next. We might ask it to predict what would be the likely 
consequences of a certain action or experiment. Or we might ask it to 
formulate the laws governing some class of events. In any case, our task 
is to equip our machine with inductive ability — with methods which it can 
use to construct general statements about events beyond its recorded ex- 
perience. Now, there can be no system :or inductive inference that will 
work well in all possible universes. But given a universe, or an ensemble 
of universes, and a criterion of success, this (epistemological) problem 
for machines becomes technical rather than philosophical. There is quite 
a literature concerning this subject, but we shall discuss only one ap- 
proach which currently seems to us the most promising; this is what 
we might call the “grammatical induction” schemes of Solomonoff (1957. 
1958, 1959a), based partly on work of Chomsky and Miller (19576, 
1958). 

We will take language to mean the set of expressions formed from some 
given set of primitive symbols or expressions, by the repeated application 
of some given set of rules; the primitive expressions plus the rules is 
the grammar of the language. Most induction problems can be framed 
as problems in the discovery of grammars. Suppose, for instance, that a 
machine’s prior experience is summarized by a large collection of state- 
ments, some labelled “good” and some “bad” by some critical device. How 
could we generate selectively more good statements? The trick is to find 
some relatively simple (formal) language in which the good statements 
are grammatical, and in which the bad ones are not. Given such a language, 
we can use it to generate more statements, and presumably these will tend 
to be. more like the good ones. The heuristic argument is that if wc can 
find a relatively simple way to separate the two sets, the discovered rule 
is likely to be useful beyond the immediate experience. If the extension 
fails to be consistent with new data, one might be able to make small 
changes in the rules and, generally, one may be able to use many or- 
dinary problem-solving methods for this task. 

The problem of finding an efficient grammar is much the same as that 
of finding efficient encodings , or programs, for machines; in each case, one 
needs to discover the important regularities in the data, and exploit the 
regularities by making shrewd abbreviations. The possible importance of 
Solomonofl’s work (1960) is that, despite some obvious defects, it may 
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point the way toward systematic mathematical ways to explore this dis- 
covery problem. He considers the class of all programs (for a given gen- 
eral-purpose computer) which will produce a certain given output (the 
body of data in question). Most such programs, if allowed to continue, 
will add to that body of data. By properly weighting these programs, per- 
haps by length, we can obtain corresponding weights for the different 
possible continuations, and thus a basis for prediction. If this prediction 
is to be of any interest, it will be necessary to show some independence 
of the given computer; it is not yet clear precisely what form such a result 
will take. 

C. Models of Oneself 

If a creature can answer a question about a hypothetical experiment, 
without actually performing that experiment, then the answer must have 
been obtained from some submachine inside the creature. The output of 
that submachine (representing a correct answer) as well as the input (rep- 
resenting the question) must be coded descriptions of the correspondinu 
external events or event classes. Seen through this pair of encoding and 
decoding channels, the internal submachine acts like the environment, and 
so it has the character of a “model.” The inductive inference problem may 
then be regarded as the problem of constructing such a model. 

To the extent that the creature’s actions affect the environment, this 
internal model of the world will need to include some representation of the 
creature itself. If one asks the creature “why did you decide to do such 
and such” (or if it asks this of itself), any answer must come from the 
internal model. Thus the evidence of introspection itself is liable to be based 
ultimately on the processes used in constructing one’s image of one’s 
self. Speculation on the form of such a model leads to the amusing pre- 
diction that intelligent machines may be reluctant to believe that they are 
just machines. The argument is this: our own self-models have a substan- 
tially “dual” character; there is a part concerned with the physical or me- 
chanical environment — with the behavior of inanimate objects — and there 
is a part concerned with social and psychological matters. It is pre- 
cisely because we have not yet developed a satisfactory mechanical 
theory of mental activity that we have to keep these areas apart. We could 
not give up this division even if we wished to — until we find a unified model 
to replace it. Now, when we ask such a creature what sort of being it is, it 
cannot simply answer “directly”; it must inspect its model(s). And it 
must answer by saying that it seems to be a dual thing — which appears 
to have two parts— a “mind” and a “body.” Thus, even the robot, unless 
equipped with a satisfactory theory of artificial intelligence, would have to 
maintain a dualistic opinion on this matter . 39 

“There is a certain problem of infinite regression in the notion of a machine 
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Conclusion 

In attempting to combine a survey of work on “artificial intelligence” 
with a summary of our own views, we could not mention every relevant 
project and publication. Some important omissions are in the area of 
“brain models”; the early work of Farley and Clark (1954) [also Farley’s 
paper in Yovitts and Cameron (I960),, often unknowingly duplicated, 
and the work of Rochester (1956) and Milner (I960)]. The work of 
Lettvin et al. (1959) is related to the theories in Selfridge (1959). We 
did not touch at all on the problems 'of logic and language, and of in- 
formation retrieval, which must be faced when action is to be based on the 
contents of large memories; see, e.g., McCarthy (1959). We have not 
discussed the basic results in mathematical logic which bear on the question 
of what can be done by machines. There are entire literatures we have 
hardly even sampled — the bold pioneering work of Rashevsky (c. 1929) 
and his later co-workers (Rashevsky, 1960); Theories of Learning, 
e.g., Gora (1959); Theory of Games, e.g., Shubik (1960); and Psy- 
chology, e.g., Bruner et al. (1956). And everyone should know the 
work of Polya (1945, 1954) on how to solve problems. We can hope 
only to have transmitted the flavor of some of the more ambitious 
projects directly concerned with getting machines to take over a larger 
portion of problem-solving tasks. 

One last remark: we have discussed here only work concerned with 
more or less self-contained problem-solving programs. But as this is 
written, we are at last beginning to see vigorous activity in the direction of 
constructing usable timesharing or multiprogramming computing systems. 
With these systems, it will at last become economical to match human 
beings in real time with really large machines. This means that we can 
work toward programming what will be, in effect, “thinking aids.” In the 
years to come, we expect that these man-machine systems will share, and 
perhaps for a time be dominant, in our advance toward the development 
of “artificial intelligence.” 


having a good model of itself: of course, the nested models must lose detail and 
finally vanish. But the argument, e.g., of Hayek (see 8.69 and 8.79, 1952) that we 
cannot “fully comprehend the unitary order” (of our own minds) ignores the power 
of recursive description as well as Turing’s demonstration that (with sufficient 
external writing space) a “general-purpose” machine can answer any question about 
a description of itself that any larger machine could answer. 
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We observe that on occasion expressions in some language are put for- 
ward that purport to state “a problem,” In response a method (or algo- 
rithm) is advanced that claims to solve the problem. That is, if input 
data are given that meet all the specifications of the problem statement, 
the method produces another expression in the language that is the 
solution to the problem. If there is a challenge as to whether the method 
actually provides a general solution to the problem (i.e., for all admissible 
inputs), a proof may be forthcoming that it does. If there is a challenge 
to whether the problem statement is well defined, additional formalization 
of the problem statement may occur. In the extreme this can reach t-ck 
to formalization of the language used to state the problem, until a formal 
logical calculus is used. 

We also observe that for other problems that people have and solve 
there seems to be no such formalized statement and formalized method. 
Although usually definite in some respects problems of this type seem 
incurably fuzzy in others. That there should be ill-defined problems 
around is not very surprising. That is, that there should be expressions 
that have some characteristics of problem statements but are fragmentary 
seems not surprising. However, that there should exist systems (i.e., men) 
that can solve these problems without the eventual intervention of formal 
statements and formal methods does pose an issue. Perhaps there are 
two domains of problems, the well structured and the ill structured. 
Formalization always implies the first. Men can deal with both kinds. 
By virtue of their capacity for working with ill-structured problems, 
they can transmute some of these into well-structured (or formalized) 
problems. But the study of formalized problems has nothing to say about 
the domain of ill-structured problems. In particular, there can never be 
a formalization of ill-structured problems, hence never a theory (in a 
strict sense) about them. All that is possible is the conversion of particu- 
lar problems from ill structured to w’ell structured via the one transducer 
that exists, namely, man. 

Perhaps an analog is useful: well -structured problems are to ill- 
structured problems as linear systems are to nonlinear systems, or as 
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stable systems are to unstable systems, or as rational behavior is to non- 
rational behavior. In each case, it is not that the world has been neatly 
divided into two parts, each with a theory proper to it. Rather, one 
member of the pair is a very special case about which much can be said, 
whereas the other member consists of all the rest of the world — uncharted, 
lacking uniform approach, inchoate, and incoherent. 

This is not the only view, of course. Alternatively, one can assert 
that all problems can be formulated in the same way. The formalization 
exists because there is some symbolic system, whether on paper or in the 
head, that holds the specific, quite definite information in the problem 
statement. The fragmentation of problem statement that occurs when an 
attempt is made to explicate the problem only shows that there are seri- 
ous (perhaps even profound) communication problems. But it does not 
say that ill-structured problems are somehow different in nature. 

Thus we have an issue — somewhat ill structured, to be sure — but still 
an issue. What are ill-structured problems and are they a breed apart 
from well-structured ones? This chapter is essentially an essay devoted 
to exploring the issue, as it stands in 1968. 

We have an issue defined. What gives life to it are two concerns, 
one broad, one narrow. At root, there is the long-standing concern over 
the rationalization of human life and action. More sharply stated, this is 
the challenge of art by science. In terms of encounters long resolved, it 
is whether photography will displace painting, or whether biology and 
physiology can contribute to the practice of medicine. In terms of en- 
counters now in twilight, it is whether new products come from applied 
science or from lone inventors. In terms of encounters still active, it is 
whether the holistic diagnostic judgment of the clinical psychologist is 
better than the judgments of a regression equation [12]. For the purpose 
of this essay, of course, it is to what extent management science can 
extend into the domain of business jud'gment. 

When put in this context, the issue has a charge. The concern flows 
partly from economics, where it is now usually labeled the problem of 
automation. Concern also flows from a problem of identity, in which 
some are compelled to ask what attributes man can uniquely call his 
own. As has been pointed out, probably most thoroughly by Ellul [3], 
it makes no sense to separate hardware and software, that is, to separate 
machines from procedures, programs, and formalized rules. They are all 
expressions of the rationalization of life, in which human beings become 
simply the agents or carriers of a universalistic system of orderly re- 
lations of means to ends. 

Thus, viewed broadly, the issue is emotionally toned. However, this 
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fact neither eliminates nor affects the scientific questions involved, al- 
though it provides reasons for attending to them. Our aim in this essay 
is essentially scientific, a fact which leads to the second, more narrow 
context. 

Within management science the nature of rationalization has varied 
somewhat over time. In the early days, those of Frederick Taylor, it was 
expressed in explicit work procedures, but since World War II it has been 
expressed in the development of mathematical models and quantitative 
methods. In 1958 we put it as follows: 

A problem ia well structured to the extent that it satisfies the following criteria: 

1. It can be described in terms of numerical variables, scalar and vector quanti- 
ties. 

2. The goals to be attained can be specified in terms of a well-defined objective 
function — for example, the maximization of profit or the minimization of cost. 

3. There exist computational routines ( algorithms ) that permit the solution to 
be found and stated in actual numerical terms. Common examples of such algorithms, 
which have played an important role in operations research, are maximization 
procedures in the calculus and calculus of variations, linear-programming algorithms 
like the stepping-stone and simplex methods, Monte Carlo techniques, and so on 
B1.PP.4-6L 

Ill-structured problems were defined, as in the introduction of this 
essay, in the negative: all problems that are not well structured. Now 
the point of the 1958 paper, and the reason that it contrasted well- and 
ill-structured problems, was to introduce heuristic programming as rele- 
vant to the issue: 

With recent* developments in our understanding of heuristic processes and their 
simulation by digital computers, the way is open to deal scientifically with ill- 
structured problems— 4o make the computer co-extensive with the human mind 
[21, p. 91. 

That is, before the development of heuristic programming (more gener- 
ally, artificial intelligence) the domain of ill-structured problems had 
been the exclusive preserve of human problem solvers. Now we had 
other systems that also could work on ill-structured problems and that 
could be studied and used. 

This 1958 paper is a convenient marker for the narrow concern of 
the present essay. It can symbolize the radical transformation brought 
by the computer to the larger, almost philosophical concern over the 
nature and possibilities for rationalization. The issue has become almost 
technical, although now it involves three terms, where before it involved 
only two: 
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• What is the nature of problems solved by formal algorithms? 

• What is the nature of problems solved by computers? 

• What is the nature of problems solved by men? 

We have called the first well-structured problems; the last remains the 
residual keeper of ill-structured problems; and the middle term offers the 
opportunity for clarification. 

Our course will be to review the 1958 paper a little more carefully, 
leading to a discussion of the nature of problem solving. From this will 
emerge an hypothesis about the nature of generality in problem solving, 
which will generate a corresponding hypothesis about the nature of ill- 
structured problems. With theses in hand, we first consider some impli- 
cations of the hypotheses, proceed to explore these a little, and finally 
bring out some deficiencies. 

The 1958 paper asserted that computers (more precisely, computers 
appropriately programmed) could deal with ill-structured problems, where 
the latter was defined negatively. The basis of this assertion was two- 
fold. First, there had just come into existence the first successful heu- 
ristic programs, that is to say, programs that performed tasks requiring 
intelligence when performed by human beings. They included a theorem- 
proving program in logic [15], a checker-playing program [19], and a 
pattern recognition program [20], These were tasks for which algorithms 
either did not exist or were so immensely expensive as to preclude their 
use. Thus, there existed some instances of programs successfully solving 
interesting ill-structured problems. The second basis was the connection 
between these programs and the nature of human problem solving .{16]. 
Insofar as these programs reflected the same problem-solving processes 
as human beings used, there was additional reason to believe that the 
programs dealt with ill-structured problems. The data base for the as- 
sertion was fairly small, but there followed in the next few years ad- 
ditional heuristic programs that provided support. There was one that 
proved theorems in plane geometry, one that did symbolic indefinite 
integration, a couple of chess programs, a program for balancing assembly 
lines, and several pattern recognition programs [5]. 

The 1958 paper provided no positive characterization of ill-structured 
problems. Although it could be said that some ill-structured problems 
were being handled, these might constitute a small and particularly “well- 
formed” subset. This was essentially the position taken by Reitman, in 
one of the few existing direct contributions to the question of ill-formed 
problems [17, 18]. He observed, as have others, that all of the heuristic 
programs, although lacking well-specified algorithms, were otherwise quite 
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precisely defined. In particular, the test whereby one determined whether 
the problem was solved was well specified, as was the initial data base 
from which the problem started. Thus, he asserted that all existing heu- 
ristic programs were in a special class by virtue of certain aspects being 
well defined, and thus shed little light on the more general case. 

Stating this another way, it is not enough for a problem to become ill 
structured only with respect to the methods of solution. It is required 
also to become ill structured with respect to both the initial data and 
the criteria for solution. To the complaint that one would not then really 
know what the problem was, the rejoinder is that almost all problems 
dealt with by human beings are ill structured in these respects. To use 
an example discussed by Reitman, in the problem of making a silk purse 
from a sow’s ear, neither “silk purse” nor “sow’s ear” is defined beyond 
cavil. To attempt really to solve such a problem, for instance, would be 
to search for some ways to stretch the implicit definitions to force ac- 
ceptance of the criteria, for example, chemical decomposition and re- 
synthesis. 

Reitman attempted a positive characterization of problems by setting 
out the possible forms of uncertainty in the specification of a problem: 
the ways in which the givens, the sought-for transformation, or the goal 
could be ill defined. This course has the virtue, if successful, of 
defining “ill structured” independently of problem solving and thus 
providing a firm base on which to consider how such problems might 
be tackled. I will not follow him in this approach, however. It seems 
more fruitful here to start with the activity of problem solving. 

1. THE NATURE OF PROBLEM SOLVING 

A rather general diagram, shown in Fig. 10.1, will serve to convey a 
view of problem solving that captures a good deal of what is known, 
both casually and scientifically. A problem solver exists in a task environ- 
ment, some small part of which is the immediate stimulus for evoking 
the problem and which thus serves as the initial problem statement. 1 
This external representation is translated into some internal represen- 
tation (a condition, if you please, for assimilation and acceptance of the 
problem by the problem solver). There is located within the memory 
of the problem solver a collection of methods. A method is some organ- 

‘Its statement form is clear when given linguistically, as in “Where do wc locate 
the new warehouse?” Otherwise, “statement" is to be taken metaphorically as com- 
prising those clues in the environment attended to by the problem solver that indi- 
cate to him the existence of the problem. 


368 


Heuristic Programming: Ill-Structured Problems 


Task Environment 



is not under the control the inputting process. 
Figure 10.1. General schema of a problem solver. 


ized program or plan for behavior that manipulates the internal repre- 
sentation in an attempt to solve the -problem. For the type of problem 
solvers we have in mind— business men, analysts, etc.— there exist many 
relatively independent methods, so that the total behavior of the problem 
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solver is made up as an iterative cycle in which methods are selected 
on the basis of current information (in the internal representation) and 
tried with consequent modification of the internal representation, and a 
new method is selected. 

Let us stay vrtth this view of problem solving- for a while, even though 
it de-emphasizes some important aspects, such as the initial determination 
of an internal representation, its possible change, and the search for or 
construction of new methods (by other methods) in the course of problem 
solving. What Fig. 10.1 does emphasize is the method— the discrete 
package of information that guides behavior in an attempt at problem 
solution. It prompts us to inquire about its anatomy. 

IJ. The Anatomy of a Method 

Let us examine some method in management science. The simplex 
method of linear programming will serve admirably. It is well known, 
important, and undoubtedly a method. It also works on well-structured 
problems, but we will take due account of this later. The basic linear 
programming problem is as follows. 

Given : a set of positive real variables 

Xj ^ 0, j a l f • • • t n 
and real constants 

&\h bit i « ip . • * i wi; j = 1> • • • i n 

maximize 

z = 2 cfc f 
i 

subject to 

2/ ^ ^ If • • • | Vt 

J 

Figure 10.2 shows the standard data organization used in the simplex 
method and gives the procedure to be followed. We have left out the 
procedure for getting an initial solution, that is, we assume that the 
tableau of Fig. 10.2 is initially filled in. Likewise, we have ignored the 
detection of degeneracy and the procedure for handling it. 

There are three parts to the simplex method. The first is the state- 
ment of the problem. This determines the entities you must be able to 
identify in a situation and what you must know about their properties 
and mutual interrelationships. Unless you can find a set of nonnegative 
numerical quantities, subject to linear constraints and having a linear 
objective function to be maximized, you do not have a LP problem, and 
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the method cannot help you. The second part of the method is the 
procedure that delivers the solution to the problem. It makes use only 
of information available in the problem statement. Indeed, these two 
are coupled together as night and day. Any information in the prob- 
lem statement which is known not to enter into the procedure can 
be discarded, thereby making the problem just that much more general. 


Simplex tableau 



Procedure 

1. Let j a = column with min {z,- — c,|z, — c,- < 0} ; 
if no Zj — Cj < 0, then at maximum z, end. 

2. Let to = row with min > 0} ; 

if no bi/Uj, > 0, theh z unbounded, end. 

3. For row t 0 , <- 

4. For row i ^ i 0 , <„■ «- t {i - ^ 

( tu] is the value from step 3). 

Define 

£„+< =* bi — 2 a%jX J} i = 1, . . . , m 

i 

Cn+t* “ 0 

= 1 if i = k } 0 otherwise 
Figure 10.2, Simplex method. 
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The third part of the method is the proof or justification that the pro- 
cedure in fact delivers the solution to the problem (or delivers it within 
certain specified limits). The existence of this justification has several 
consequences. One, already noted, is the complete adaptation of means 
to ends — of the shaping of the problem statement so that it is as general 
as possible with respect to the procedure. Another consequence is a 
toleration of apparent meaninglessness in the procedure. It makes no 
difference that there seems to be neither rhyme nor reason to the steps 
of the method in Fig. 10.2. Careful analysis reveals that they are in 
fact' just those steps necessary to the attainment of the solution. This 
feature is characteristic of mathematical and computational methods 
generally and sometimes is even viewed as a hallmark. 

An additional part of the simplex method is a rationale that can be 
used to make the method understandable. The one usually used for the 
simplex is geometrical, with each constraint being a (potential) boundary 
plane of the space of feasible solutions. Then the simplex procedure is 
akin to c lim bing from vertex to vertex until the maximal one is reached. 
This fourth part is less essential than the other three. 

The first three parts seem to be characteristic of all methods. Cer- 
tainly, examples can be multiplied endlessly. The quadratic formula pro- 
vides another clear one: 

Problem statement: Find x such that ax 2 + bx + c = 0. 

Procedure: compute x = b/2a ± %a Vb 2 — 4ac. 

Justification: (substitute formula in ax^ + bx + c and show 

by algebraic manipulation that 0 results). 

In each case a justification is required (and forthcoming) that estab- 
lishes the relation of method to problem statement. As we move toward 
more empirical methods, the precision of both the problem statement and 
the procedure declines, and concurrently the precision of the justification; 
in fact, justification and plausible rationale merge into one. 

1.2 • Generality and Power 

We need to distinguish the generality of a method from its power. 
A method lays claim via its problem statement to being applicable to a 
certain set of problems, namely, to all those for which the problem state- 
ment applies. The generality of a method is determined by how large 
the set of problems is. Even without a well-defined domain of all prob- 
lems, or any suitable measure on the sets of problems, it is still often 
possible to compare two problem statements and judge one to be more 
inclusive than another, hence one method more genera) than the other. 
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A method that is applicable only to locating warehouses is less general 
than one that is applicable to problems involving the location of all physi- 
cal resources. But nothing interesting can be said about the relative 
generality of a specific method for inventory decisions versus one for 
production scheduling. 

Within the claimed domain of a method we can inquire after its ability 
to deliver solutions: the higher this is, the more powerful the method. 
At least three somewhat independent dimensions exist along which this 
ability can be measured. First, the method may or may not solve every 
problem in the domain; and we may loosely summarize this by talking of 
the probability of solution. Second, there may exist a dimension of quality 
in the solution, such as how close an optimizing method gets to the peak. 
Then methods can differ on the quality of their solutions. (To obtain a 
simple characteristic for this requires some summarization over the ap- 
plicable domain, but this feature need not concern us here.) Third, the 
method may be able to use varying amounts of resources. Then, judg- 
ments of probability of solution and of quality are relative to the amount 
of resources used. Usually the resource will be time, but it can also be 
amount of computation, amount of memory space, number of dollars to 
acquire information, and so on. For example, most iterative methods for 
solving systems of equations do not terminate in a finite number of 
iterations, but produce better solutions if run longer; the rate of con- 
vergence becomes a significant aspect of the power of such methods. 

In these terms the simplex method would seem to rank as one of 
limited generality but high power. The restrictions to linearity, both in 
the constraints and the objective function, and to a situation describable 
by a set of real numbers, all constrict generality. But the simplex method 
is an algorithm within its domain and guarantees delivery of the complete 
solution. It is not the least general method, as is indicated by the trans- 
portation problem with its more specialized assumptions; nor is it the 
most powerful method for its domain, since it can be augmented with 
additional schemes that obtain solutions more expeditiously. 

Evidently there is an inverse relationship between the generality of 
a method and its power. Each added condition in the problem statement 
is one more item that can be exploited in finding the solution, hence in 
increasing the power. If one takes a method, such as the simplex method, 
and generalizes the problem statement, the procedure no longer solves 
every problem in the wider domain, but only a subset of these. Thus 
the power diminishes. The relationship is not one-one, but more like a 
limiting relationship in which the amount of information in the problem 
statement puts bounds on how powerful the method can be. This re- 
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Figure 10.3. Generality versus power. 


lationship is important enough to the argument of this essay that we 
indicate it symbolically in Fig. 10.3. The abcissa represents increasing 
information in the problem statement, that is, decreasing generality. The 
ordinate represents increasing power. For each degree of generality an 
upper bound exists on the possible power of a method, though there are 
clearly numerous methods which do not fully exploit the information in 
the problem statement. 

2. TWO HYPOTHESES: ON GENERALITY AND 
ON ILL-STRUCTURED PROBLEMS 

With this view of method and problem solver we can move back 
toward the nature of ill-structured problems. However, we need to ad- 
dress one intermediate issue: the nature of a general problem solver. 
The first heuristic programs that were built laid elaims to power, not 
to generality. A chess or checker program was an example of artificial 
intelligence because it solved a problem difficult by human standards; 
there was never a pretense of its being general. Today’s chess programs 
cannot even play checkers, and vice versa. 

Now this narrowness is completely consistent with our general ex- 
perience with computer programs as highly special methods for restricted 
tasks. Consider a typical subroutine library, with its specific routines 
for inverting matrices, computing the sine, carrying out the simplex 
method, and so on. The only general “programs” are the higher-level 


C-13 


374 


Heuristic Programming: Ill-Structured Problems 


programming languages, and these are not problem solvers in the usual 
sense, but only provide means to express particular methods. 2 Thus the 
view has arisen that, although it may be possible to construct an artificial 
intelligence for any highly specific task domain, it will not prove possible 
to provide a general intelligence. In other words, it is the ability to be 
a general problem solver that marks the dividing line between human 
intelligence and machine intelligence. 

The formulation of method and problem solver given earlier leads 
rather directly to a simple hypothesis about this question : 

Generality Hypothesis. A general problem solver is one that has a collection of 
successively weaker methods that demand successively less of the environment in 
order to be applied. Thus a good general problem solver is simply one that has the 
best of the weak methods. 

This hypothesis, although itself general, is not without content. (To put 
it the way that philosophers of science prefer, it is falsifiable.) It says 
that there are no special mechanisms of generality— nothing beyond the 
willingness to carry around specific methods that make very weak de- 
mands on the environment for information. By the relationship expressed 
m Fig. 10.3 magic is unlikely, so that these methods of weak demands 
will also be methods of low power. Having a few of them down at the 
very low tip in the figure gives the problem solver the ability to tackle 
almost any kind of problem, even if only with marginal success. 

There are some ways in which this generality hypothesis is almost 
surely incorrect or at least incomplete, and we will come, to these later; 
but let us remain with the main argument. There is at least one close 
association between generality and ill-structured problems: it is man that 
can cope with both. It is also true that ill-structurcd problems, what- 
ever else may be the case, do not lend themselves to sharp solutions. 
Indeed, their lack of specificity would seem to be instrumental in pro- 
hibiting the use of precisely defined methods. Since every problem does 
present some array of available information— something that could meet 
the conditions of a problem statement of some method— the suspicion 
arises that lack of structure in a problem is simply another indication 
that there are not methods of high power for the particular array of 
information available. Clearly this situation does not prevail absolutely, 
ut only with respect to a given problem solver and his collection of 
methods (or, equally, a population of highly similar problem solvers). 
We can phrase this suspicion in sharper form: 

‘The relationship of programming languages to problem solvers, especially as the 
anguagos become more problem-oriented, is unexplored territory. Although relevant 
to the mam question of this essay, it cannot be investigated further here. 
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lUslrvLclured. Problem Hypothesis. A problem solver finds a problem ill structured 
*£££ er of hL methods^ are applicable to the problem lies below a certain 

threshold. 

The lack of any uniform measure of power, with the consequent lack 
of precision about a threshold on this power, is- not of real concern: the 
notion of ill-structuredness is similarly vague. The hypothesis says that 
the problem of locating a new warehouse will look well structured to a 
firm that has, either by experience, analysis, or purchase, acc ^ ire 
programmed procedure for locating warehouses, providing it has decided 
that the probability of obtaining an answer of suitable quahty is igh 
enough simply to evoke the program m the face of the new locati 
problem. The problem will look ill structured to a firm that has only ita 
general problem-solving abilities to fall back on. It can only have the 
most general faith that these procedures will discover appropriate infor- 
mation and use it in appropriate ways in making the decision. 

My intent is not to argue either of these two hypotheses directly but 
rather to examine some of their implications. First, t e wea me o 
must be describable in more concrete terms. This we will do m some detail, 
since it has been the gradual evolution of such methods in ^ificial mte - 
ligence that suggested the hypotheses in the first place. Second, the picture 
of Fig. 10.3 suggests not only that there are weak methods and strong ones, 
but that there is continuity between them in some sense. Phrase an- 
other way, at some level the methods of artificial intelligence and those 
of operations research should look like members of the same family. \\e 
will also look at this implication, although somewhat more sketclu y, 
since little work has been done in this direction. Third, we can revisit 
human decision makers in ill-structured situations. This we do in an 
even more sketchy manner, since the main thrust of this essay stems 
from the more formal concerns. Finally, after these (essentially positive) 
explications of the hypotheses, we will turn to discussion of some dif- 
ficulties. 

3. THE METHODS OF HEURISTIC PROGRAMMING 

There has been continuous work in artificial intelligence ever since 
the article- quoted at the beginning of this chapter [21] took note of the 
initial efforts. The field has had two main branches. We will concentrate 
on the one called heuristic programming. It is most closely identifie 
with the programmed digital computer and with problem solving. Also, 
almost all the artificial intelligence efforts that touch management sci- 
ence are included within it. The other branch, identified with pattern 
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recognition, self-organizing systems, and learning systems, although not 
exempt from the observations to be made here, is sufficiently different to 
preclude its treatment. 

A rather substantial number of heuristic programs have been con- 
structed or designed and have gone far enough to get into the literature. 
They cover a wide range of tasks: game playing, mostly chess, checkers, 
and bridge; theorem proving, mostly logic, synthetic geometry, and 
various elementary algebraic systems; all kinds of puzzles; a range of 
management science tasks, including line balancing, production sched- 
uling, and warehouse location; question-answering systems that accept 
quasi-English of various degrees of sophistication; and induction prob- 
lems of the kind that appear on intelligence tests. The main line of 
progress has constituted a meandering tour through new task areas which 
seemed to demand new analyses. For example, there is considerable cur- 
rent work on coordinated effector-receptor activity (e.g., hand-eye) in 
the real world— a domain of problems requiring intelligence that has not 
been touched until this time. 

Examination of this collection of programs reveals that only a few 
ideas seem to be involved, despite the diversity of tasks. These ideas, if 
properly expressed, can become a collection of methods' in the sense used 
earlier. Examination of these methods shows them to be extraordinarily 
weak compared with the methods, say, of linear programming. In compen- 
sation, they have a generality that lets them be applied to tasks such as 
discovering proofs of theorems, where strong methods are unknown . 3 

t thus appears that the work in heuristic programming may provide 
a first formalization of the kind of weak methods called for by our two 
hypotheses. (To be sure, as already noted, psychological invention runs 
the other way: the discoveiy that there seems to be a small set of methods 

un er ying the diversity of heuristic programs suggested the two hy- 
potheses.) J 

It might be claimed that the small set of methods shows, not parsi- 
mony, but the primitive state of development of the field and that in- 
vestigators read each other’s papers. Although there is clearly some force 
to this argument, to an important degree each new task attempted in 
heuristic programming represents an encounter with an unknown set of 
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mel? of Z7 r 7 77 *" to rcdufc who,e arcas a The develop- 

Z in ll*' n? r CaIcuIus - l!lter Laplace and Fourier transforms-is a 
Snd thZ / P ? SCnl t lcorcm *P rov,n S programs and the methods that lie 

metheH h ^m 11 ? mV ° V ° matI,ematical advances; rather they appear to capture 
methods available for proof discovery within the existing state of ignorance. 
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demands that have to be met on their own terms. Certainly the people 
who have created heuristic programs have often felt this way. In fact, 
the complaint is more often the opposite to the above caveat — that arti- 
ficial intelligence is a field full of isolated cases with no underlying 
coherency. 

In fact, the view expressed in this chapter is not widely held. There 
is some agreement that all heuristic theorem, provers and game players 
make use of a single scheme, called heuristic search. But there is little 
acknowledgment that the remainder of the methods listed below con- 
stitute some kind of basic set. 

With this prelude, let us describe briefly some methods. An adequate 
job cannot be done in a single chapter; it is more an undertaking for 
a textbook. Hopefully, however, some feeling for the essential char- 
acteristics of generality and power can be obtained from what is given. 
The first three, gencrate-and-test, match, and hill climbing, rarely occur 
as complete methods in themselves (although they can), but are rather 
the building blocks out of which more complex methods are composed. 

3.L Generate-and-Test 

This is the weak method par excellence. All that must be given is 
a way to generate possible candidates for solution plus a way to test 
whether they are indeed solutions. Figure 10.4 provides a picture of 
generate-and-test that permits us to view' it as a method W'ith a problem 
statement and a procedure. The flow' diagram in the figure adopts some 
conventions that will be used throughout. They allow' expresson of the 
central idea of a method without unnecessary detail. The lines in the 
diagram show the flow r of data, rather than the flow of control more 
in the style of an analog computer flow' diagram than a digital com- 
puter flow r diagram. Thus the nodes represent processes that receive 
inputs and deliver outputs. If a node is an item of data, as in the pre- 
dicate P or the set {x} (braces are used to indicate sets), it is a memory 
process that makes the data item available. A process executes (or fires) 
when it receives an input; if there are several inputs, it waits until all 
appropriate ones have arrived before firing. 

A generator is a process that takes information specifying a set and 
produces elements of that set one by one. It should be viewed as autono- 
mously “pushing” elements through the system. Hence there is a flow of 
elements from generate to the process called test. Test is a process that 
determines whether some condition or predicate is true of its input and 
behaves differentially as a result. Two different outputs arc possible: 
satisfied ( + ) and unsatisfied ( — ). The exact output behavior depends 
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Problem statement 

Given: a generator of the set {x} ; 

a test of the predicate P defined on elements of {x} ; 

Find: an element of {x} that satisfies P(x). 

Procedure 

P 

element input * + solution 

\x} — > generate > test > 

Justification 

To show y is a solution if and only if y £ {x} and P(y). 

Notation: x — * a means that process x produces a; 

a/i 3 means that a is on line labeled 0. 

Test has associated with it a predicate P on one variable, such that: 

test -4 a/+ if and only if P(a) and a/input; 
test a/— if and only if “| P(a) and a/input. 

Generate has associated with it a set {x} such that: 

* generate — > a/element only if a £ {x} ; 
a £ {x} implies there exists a time when generate — * a/element. 
Working backward from the flow line labeled solution, we get: 

1. y/solution if and only if test — » y/+. 

2. test —4 y/+ if and only if P{y) and y/ input. 

Now we need only show that y/input if and only if y £ {x} . 

3. y/input if and only if generate — > y/element. 

4. generate -4 y/element only if y £ {x} . 

Now we need only show that y £ {x} implies generate — > y/element; 
however, the best we can do is: 

5 . y £ {x} implies there exists a time when generate -4 y/elemcnt. 
Figure 10.4. Generate-and-test method. 


on the needs of the rest of the processing. -The input can be passed 
through on one condition and nothing done on the other, in which case 
test acts as a filter or gate. The input can be passed through in both 
cases, but on different output lines, in which case test acts as a binary 
switch. 

The set associated with a generator and the predicate associated with 
a test are not inputs. Rather, they are constructs in terms of which the 
behavior of the process can be described. This is done in Fig. 10.4 by 
listing a set of propositions for each process. The single arrow (— ►) indi- 
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cates production of an output, and the slash (/) indicates that a data 
item is on the line with the given label. Thus the first proposition under 
test says that test produces an item on its + output line if and only if 
that item was input and also satisfies the associated predicate. For any 
particular generator the associated set must be fully specified, but clearly 
that specification can be shared in particular ways between the structure 
of the generator and some actual inputs; for example, a generator could 
take an integer as input and produce the integers greater than the input, 
or it could have no input at all and simply generate the positive integers. 
The same situation holds for test or any other process: its associated 
constructs must be fully specified, but that specification can be shared 
in different ways between the structure of the process and some of its 
inputs. Sometimes we will put the associated construct on the flow dip- 
gram, as we have in Fig. 10.4, to show the connection between the proc- 
esses in the flow diagram and the constructs used in the statement of the 
problem. We use dotted lines to show that these are not really inputs, 
although inputs could exist that partially specify them. 

We have provided a sketch of a justification that the procedure of 
the method actually solves the problem. In substance the proof is trivial. 
To carry it through in detail requires formalization of both the procedure 
and the language for giving the problem statements and the properties 
known to hold if a process is executed [6]. The handling of time is a 
bit fussy and requires more formal apparatus than is worthwhile to pre- 
sent here. Note, for instance, that if the generator were not allowed to 
go to conclusion, generate-and-test would not necessarily produce a so- 
lution. Similar issues arise with infinite sets. Justifications will not be 
presented for the other methods. The purpose of doing it for this (sim- 
plest) one is to show that all the components of a method — problem state- 
ment, procedure, and justification— exist for these methods of artificial 
intelligence. However, no separate rationale is needed for generate-and- 
test, partly because of its simplicity and partly because of the use of a 
highly descriptive procedural language. If we had used a machine code, 
for instance, we might have drawn the procedure of Fig. 10.4 as an in- 
formal picture of what was going on. 

Generate-and-test is used as a complete method, for instance, in 
opening a combination lock (when in desperation). Its low power is 
demonstrated by the assertion that a file with a combination lock is a 
“safe.” Still, the method will serve to open the safe eventually. Generate- 
and-test is often used by human beings as a second method for finding 
lost items, such as a sock or a tiepin. The first method relies on recol- 
lections about where the item was left or last seen. After this has failed, 
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generate-and-test is evoked, generating the physical locations in the 
room one by one, and looking in each. 

The poor record of generate-and-test as a complete method should 
not blind one to its ubiquitous use when other information is absent. It 
is used to scan the want ads for neighborhood rentals after the proper 
column is discovered (to the retort “What else?”, the answer is, “Right! 
That’s why the method is so general”). In problem-solving programs it is 
used to go down lists of theorems or of subproblems. It serves to detect 
squares of interest on chessboards, words of interest in expressions, and 
figures of interest in geometrical displays. 

3J2. Match 

We are given the following expression in symbolic logic: 
e; (?V})D((pVg)v(rD p)) 

A variety of problems arise from asking whether e is a member of various 
specified sets of logic expressions. Such problems can usually be thrown 
into the form of a generate-and-test, at which point the difficulty of find- 
ing the solution is directly proportional to the size of the set. 

If we know more about the structure of the set, better methods are 
available. For instance, consider the following two definitions of sets: 

Si : xD(zV y ), where x and y are any logic expressions. 

Examples: pD (p V g),gD (j v g), (p v p) 3 ((p vp) v’p), . ... 

S 2 : a , where a may be replaced (independently at each occurrence) 

according to the following schemes: 

a 4— g, a <— (p V a), a <— aO a. 

Examples: q y p V q, q D g, p v (p v j), (p v 5) D (p v . 

In Si, x and y are variables in the standard fashion, where each occur- 
rence of the variable is to be replaced by its value. In S 2 we have defined 
a .replacement system, where each separate occurrence of the symbol a 
may be replaced by any of the given expressions. These may include 
a, hence lead to further replacements. A legal logic expression exists only 
when no as occur. 

It is trivial to determine that e is a member of the set of expressions 
defined by Si, and not so trivial to determine that it is not a member of 
the set defined by S*. The difference is that for Si we could simply match 
the expressions against the form and determine directly the values of the 
variables required to do the job. In the case of Su we had essentially 
to generate-and-test. (Actually, the structure of the replacement system 
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permits the generation to be shaped somewhat to the needs of the task, 
so it is not pure generate-and-tcst, which assumes no knowledge of tne 
internal structure of the generator.) 

Figure 10.5 shows the structure of the match method, using the same 
symbolism as in Fig. 10.4 for the generate-and-iest. A key assumption, 
implicit in calling A r and F expressions, is that it is possible to generate 
the subparts of X and F f and that X and F are equal if and only if cor- 
responding subparts arc equal. Thus there are two generators, which 
produce corresponding subparts of the two expressions as elements. These 
are compared: if equal, the generation continues; if not equal, a test is 
made if the element from the form is a variable. If it is, a substitution 
of the corresponding part of X for the variable is possible, thus making 
the two expressions identical at that point, and permitting generation to 
continue. The generators must also produce some special end signal, 
whose co-occurrence is detected by the compare routine to determine that 
a solution has been found. 

The match procedure sketched in Fig. 10.5 is not the most general 
one possible. Operations other than substitution can be used to modify 
the form (more generally, the kernel structure) so that it is equal to X . 
There can be several such operations with the type of difference between 
the two elements selecting out the appropriate action. This action can 


Problem statement 

Given: expressions made up of parts from a set S; 
a set of variables {t>} with values in 5 ; 
a form F, which is an expression containing variables; 
an expression X. 

Find: if X is in the set defined by F; that is, 

Find: values for (v) such that X = F (with values substituted). 


Procedure 

F > generate dement 

X * generate element 


/ 


compare ■ 

both 
end 


solution 


* (fe) 


M 


* test > substitute 


4-failure 


Figure 10.5. 


/ 


Match method. 
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. resu ^ * n modification of X as well as F. It is possible to write a single 
procedure that expresses these more general possibilities, but the detail 
does not warrant it. The essential point is that generation occurs on the 
parts of the expressions, and when parts fail to correspond it is possible 
to make a local decision on what modifying operation is necessary 
(though perhaps not sufficient) for the two expressions, to become equal. 

Matching is used so pervasively in mathematical manipulation, from 
algebraic forms to the conditions of a theorem, that our mathematical 
sophistication leads us not to notice how powerful it is. Whenever a set 
of possible solutions can be packaged as a form with variables, the search 
for a solution is no longer proportional to the size of the set of all pos- 
sible solutions, but only to the size of the form itself. Notice that the 
generate process in generate-and-test (Fig. 10.4) operates on quite a 
different set from the generate of the match (Fig. 10.5) . 

Beside the obvious uses in proving theorems and doing other mathe- 
matics, matching shows up in tasks that seem remote from this discipline. 
One of theni, as shown below, is inducing a pattern from a part. Another 
use is in answering questions in quasi-natural language. In the latter 
information is extracted from the raw text by means of forms, with the' 
variables taking subexpressions in the language as values. 

3.3. Hill Climbing 

The most elementary procedure for finding an optimum is akin to 
generate-and-test, with the addition that the candidate element is com- 
pared against a stored element — the best so far — and replaces it if higher. 
The element often involves other information in addition to the position 
m the space being searched, for example, a function value. With just a 
little stronger assumptions in the problem statement, the problem can be 
converted into an analog of climbing a hill. There must be available a 
set of operators that find new elements on the hill, given an existing ele- 
ment. That is, new candidate elements are generated by taking. a step 
rom the present position (one is tempted to say a “nearby” step, but it 
is the operators themselves that define the concept of nearness). 'Thus 
the highest element so far plays a dual role, both as the base for gener- 
ation of new elements and as the criterion for whether they should be 
kept. 

. Fi ? urc 10.0 provides the capsule formulation of hill climbing. Gener- 
ation is over the set of operators, which are then applied to the best i so 
far, until a better one is found. This method differs from the various 
forms of steepest ascent in not finding the best step from the current 
position before making the next step. 
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Problem statement 

Given: a comparison of two elements of a set {x} to determine which is 

greater; . . . . 

a set of operators {5} whose range and domain is (x/ 

[i.e., 3(1) = x', another element of {x}]. 

Find: the greatest x £ {x}. 


Procedure 
{5} — •» generate - 


operator , ** 

> apply 


* / >x 


>ply > compare 1 

1 1 1 


best so far 


Figure 10.6. Hill climbing. 

A great deal has been written about hill climbing, and the interested 
reader will find a thorough discussion within the context of more elabo- 
rate methods for finding optima in Chapter 9 on structured heuristic 
programming. Here we note only the familiar fact that the method does 
not guarantee to find the best element in the space. An additional con- 
dition, unimodality, is required; otherwise the procedure may end up 
on a local peak, which is lower than the highest peak. Actually, uni- 
modality is not easy to define in the most general situation to which hill 
climbing applies, since the underlying space (which is ‘ seen” only through 
the operators) need not have neighborhoods in the sense required to de- 
fine a local peak. . . 

Hill climbing shows up in a subordinate way in many heuristic pro- 
grams, especially in the adaptive setting of parametric values. For ex- 
ample, in one program that attempted to construct programs satisfying 
certain criteria [9], the various basic instructions were selected at random 
and used to extend the program built so far. The entire program was an 
elementary form of heuristic search, discussed in Section 3.4. But super- 
imposed on it was a hill-climbing program that gradually modified the 
probabilities of selecting* the basic instructions so as to maximize the 
yield of programs over the long haul. The operators randomly jiggled 
the selection probabilities around (always maintaining their sum equal 
to one). The comparison was made on a statistical basis after observing 
the performance with the new probabilities for, say, 100 randomly se- 
lected problems. 

Management science is much concerned with seeking optima, although, 
as mentioned above, the methods used are more elaborate. This can be 
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illustrated by a heuristic program developed by Kuehn and Hamburger 
[11] for locating warehouses so as to balance the costs of distribution 
(which decrease with more warehouses) and the costs of operation (which 
increase with more warehouses). The program consists of three separate 
optimizers: a cascade of a generate-and-test optimizer and a steepest 
ascent optimizer, followed by a simple hill climber, followed by a set of 
simple generate-and-test optimizers. Figure 10.7 gives the problem state- 
ment (leaving out details on the cost functions) and an indication of 
how the problem is mapped into the problem space 4 for optimization. 
Three operators are defined, corresponding to the three separate stages 
already mentioned. The procedure is given for the first stage (called the 
Main Routine), but not for the other two (called the Bump and Shift 
Routine) . 

The elements of the problem space consist of all subsets of warehouses, 
taken from a list of possible sites (which is a subset of the total set of 
sites with customer demand). The program builds up the set of ware- 
houses by the operation of adding one warehouse at a time. The actual 
data structure corresponding to the element consists not only of the list 
of warehouses but also the assignment of each customer to a warehouse, 
the partial cost borne by that warehouse, and the total cost of operating 
(TC). (That incremental cost calculations are easier to make than calcu- 
lations starting from scratch is an important aspect of the efficiency of 
programs such as this one.) The main part of the program simply con- 
siders adding new warehouses (i.e., taking steps in the problem space) 
and comparing these against the current position on total cost. It is a 
steepest ascent scheme, since it goes through the whole set and then picks 
the best one. The additional wrinkle is to eliminate from the set of un- 
used warehouses any whose costs are less than the current position, thus 
depleting the set to be considered. In fact, the first stage terminates 
when this set becomes empty. 

The operator generator delivers only a fixed subset of all possible 
warehouse sites. It does this by a simple hill-climbing scheme whereby 
the best n sites are chosen from the total set on the basis of local cost 
(LC), which is the cost savings to be made from handling the local de- 
mand at the same site as the warehouse (n is a parameter of the pro- 
gram). This cascading of two optimizers keeps the second one from 
becoming excessively costly. 

The next two stages (the Bump and Shift Routine) make minor 

‘We often use the term problem space to refer to the set of potential solutions 
as defined by the problem statement of a method. It includes the associated. oper- 
ations for moving around in the space. 
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Problem statement 

Given: a set of customers, {c}, with locations and sales volume, 
a set of factories, {/}, with locations; 

a set of warehouse sites, {to} , with transportation costs to cus- 
tomers and to factories, and operating costs. 

Find: a set of warehouses that minimizes total costs. 


Problem space for hill climbing 

Elements: {x|x is a subset of {w}}, 

for each x can compute TOix). 

Initial element: the null set. 

Desired element: the element with lowest 1 C. 

Operators: 1. Add to to x; 

2. delete to from x; 

3. move to £ £ to location of customer of to. 

Note: all these permit incremental calculation of TC, since 
only paired comparisons with existing to for each customer 
affected are required. 


Procedure for stage 1 (operator 1) 


M 

-r 


generate - 
operators 

| end 


(<» times) 


-*■ apply 


• delete 

^ greeter TC 

-* compare * {x|next moves) 

i 


I 


= {iu| so far} 


select 
* greatest 


Generate operators: 

{to} > generate — 

i end 

TC(x) = total cost of x to supply all customers. 

LC(to) = local cost of to to supply customers at location of to. 


LC KLCu * ^ 

-> compare > • • • > w u 

T i 

Wn < — select nth 


► generate—* 


Figure 10,7. Warehouse location heuristic program. 


adjustments in the clement (the set of warehouses) that results from the 
first phase. Warehouses are successively eliminated if they fail to pay 
for themselves. This is hill climbing if the order of elimination affects 
subsequent costs; otherwise it is simply generating through the parts of 
the system, making local changes. Note that no new warehouses are 
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added to the solution element after deletion. Finally, as the last stage, 
each warehouse is moved around locally in its own territory to find the* 
most advantageous location. This stage constitutes a scries of independent 
generate-and-test optimizations, since no interaction between warehouses 
is involved. 


3.4. Heuristic Search 


The best-known method in heuristic programming is the one whereby 
the problem is cast as a search through an exponentially expanding space 
of possibilities— as a search which must be controlled and focused by the 
application of heuristics. All of the game-playing and theorem-proving 
programs make use of this method, as well as many of the management 
science applications [13 J. 5 

Figure 10.8 gives the most elementary variant of the method. It as- 
sumes a space of elements, the problem space, which contains one ele- 
ment representing the initial position, and another representing the final 
or desired position. Also available is a fixed set of operators, which when 
applied to elements in space produce new elements. (Operators need not 
always be applicable.) The problem is to produce the final desired po- 
sition, starting at the initial one. 

With only this information available, the method involves a search 
that expands in a tree-like fashion. The initial element x 0 is the initial 
current position; operators are selected and applied to it; each new ele- 
ment is compared with x„ to see whether the problem is solved; if not, 
it is added to a list of obtained positions (also called the "try list” or 
the ‘subproblem list”) ; and one of these positions is selected from which 
to continue the search. If about B of the operators applied are applicable 

to an obtained position, about B D elements will have been reached after 
D steps. 

The search is guided (i.e., the tree pruned) by appropriate selection 
and rejection of operators and elements. The flow diagram provides a 
scheme upon which the various possibilities can be localized. The most 
elementary ones are unconditional: a rule for operator selection or for 
element selection. The latter is often accomplished by keeping an ordered 
list of elements and simply selecting the first one on the list; hence the 
order is dictated by the insertion process. The simplest rules have been 
given names. Thus, if the list is last-in-first-out (so that insertion is al- 


. * t gamG pl f- vers arc exceptions. They use a recognition method that learns 

° Cla „ e t0 cach gamo Position (or class of positions) a good move. Although 
eoretically capable of handling complex games through the development of an ap- 
propriate classification, this method has not been used in any but simple games 
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Problem statement 

Given: a set {x}, the problem space; 

a set of operators {5} with range and domain in {x} ; 
an initial element, xo; 
a desired element, xj. 

Find : a sequence of operators, q iy qt , . . . , g», such that they transform 
Xo into Xd : 

— 1 * • • 5 l(^o) • * ™ Xd 

Procedure 


Basic: 

{g} — ► select 


Means-end: 


{g} — ► select ■ 

difference 


■ compare <- 
Xd 


apply ■ 


x <— select 


f&ilure 


Xd 

I criterion 
+ + 
test 

. i- 

insert 

i 

• {x|obtained} 


apply • 


Xd 

I criterion 


* compare — 

insert 

i 

- select {x|obtained} 


l 


failure 


solution 


solution 


Figure 10.8. Heuristic search method. 


ways at the front), the resulting search is depth first. This scheme was 
used by almost all early game-playing programs. If the list is first-in- 
first-out (so that insertion is always at the end) , the resulting search is 
breadth first . This scheme has been used by some theorem provers. 

An element may be completely rejected at the time of insertion. One 
of the most frequent heuristics is to reject if the element is already in 
the obtained list, that is, to avoid duplication. (This is a heuristic, since, 
though always beneficial, it requires extra effort and memory; thus it 
may not pay compared to, say, additional search.) Information already 
available in the scheme can be used; for instance, a rather important 
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variant is called means-ends analysis. The current element x is com- 
pared with the desired element x dl and the resulting difference is used to 
select the operator, that is, the operator (means) is selected with a view 
toward the end. The flow diagram for this variant is given in the figure 
below the basic heuristic search. 

The situation described in Fig. 10.8 is overly simple in several re- 
spects. Initially, a set of elements may be given, rather than just one, 
with the search able to start from any of them. Likewise, finding any 
one of a set of elements can be desired, rather than just finding a single 
one. This final element (or set) can be given by a test instead of by an 
element, although this has consequences for variants such as means-ends 
analysis. More important, the operators need not involve only a single 
input and a single output. In logic, for instance, important rules of 
inference, such as modus ponens , take two inputs and deliver a single 
output: from a and a D b infer 6. Thus we can have multiple inputs for 
an operator. Similarly, if one is working backwards in logic (that is, 
from the conclusion to premises that make this conclusion follow), modus 
ponens shows up as an operator that has a single input but a set of out- 
puts: to get 6, prove a and a D b. Furthermore, all of the outputs must 
be obtained; thus independent subproblems must radiate from each ele- 
ment of the output set in order to solve the problem. 

Figure 10.9 shows one of the early theorem provers, LT (the Logic 
Theorist), which worked on elementary symbolic logic [15]. It is a 
heuristic search, using a breadth first strategy. However, it also uses 
generate-and-test and match, so that, like the warehouse location pro- 
gram, it has a composite structure. Comparison of LT with the basic 
heuristic search method will show that there are two unexpected twists 
to formulating the task of theorem proving for heuristic search. First, 
the problem has to be turned around, so that LT works backward from 
the original goal toward the given theorems. Thus the rules of inference 
must be expressed in an inverse sense. Second, the assumed theorems 
enter into the task both in ttie generation of operations and in the test. 
This actually reflects a restriction to the generality of LT, since it insists 
that one of the two expressions in the backward rules of inference be 
tied immediately to a theorem, rather than being a subproblem which 
need only make contact eventually. 

The only elaborations of LT from the basic heuristic search procedure 
are the insertion of a test for similarity between t and x before trying to 
apply the operator, and the rejection of duplicate elements, which re- 
quires keeping a list of the elements already tried. The test for solution 
is elaborated in the minimal gencratc-and-test way to take into account 


Problem statement 

Given: the rules of inference of propositional logic; 
a set of theorems, {t}, assumed valid; 
a theorem, x 0 . 

Find: a proof of x 0 from the assumed theorems. 

Problem space for heuristic search 

Elements: logic expressions, {x}. 

Initial element: theorem to be proved, xo- 
Desired elements: any of the assumed theorems, {<}• 

Operators: pairs (rn, x), where t is any assumed theorem and m is an 
expression of the rules of inference, working backwards: 

MDt (Detachment): (t:aZ> b,x:b) —*a • 

MChF (Chaining forward): (t:a 3 b, x:a Z> c) —*b O c 

MChB (Chaining backward) : (t:a Z) b, x:d Z) b) —*■ d Z> a 


Procedure 

{ f- 

generate — 
operators 


(«,<) 


test - 


apply- 


V *r 4. 

► test if 


solution 


similarity 


theorem 


test 


dup' 


icate 


yr 

insert 

x<— generate <— {x|untried}— I 

Li 


insert — * {x|tried}- 


(m t t) 


Generate operators: {MChB, MChF, MDt} ►generate ►generate- 

<0 

r 

Test if , 1 + 

theorem: {<} -* generate -> match * 

Apply: match + -> construct — — * 

output 

Match: operations are substitution and the definition: a Z3 b = ~a v 6 
Figure 10.9. LT: Logic Theorist. 
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that any of the set of theorems will do. Although we have not shown 
the internal structure of the match, it docs use definitions as well as 
substitutions to make a theorem and an expression the same. 

3.5. Induction 

Figure 10.10 shows a task that clearly involves inductive reasoning: 
you are to determine which of the figures 1-5 bears the same relationship 
to figure C as figure B does to figure A [4]. Similar problems exist in 
extrapolating series [22] : for example, what is the blank in abcbcdcd_1 


ANALOGIES 



Figure 10.10. Analogies task. 
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Another similar task is to discover a concept, given a sequence of items, 
some of which exemplify the concept whereas others do not [8]: for 
example, if xoxox, xoxxo, oxoxo are positive instances and xooxo, xxoox, 
xoooo are negative instances, what is oxxxo? 

Computer programs have been constructed for these tasks. They show 
a certain diversity due to the gross shape of the task; that is, the task of 
Fig. 10.10 gives one instance of the concept (A:B) and five additional 
possible ones {C:l, C:2,..., C:5}, whereas the series provides a long 
sequence of- exemplars if one assumes that each letter can be predicted 
from its predecessors. However, most of the programs use a single method, 
adapted to the particular top-level task structure. 0 Figure 10.11 gives the 
method, although somewhat more sketchily than for the others. 

The first essential feature of the method is revealed in the problem 
statement, which requires the problem to be cast as one of finding a 
function or mapping of the given data into the associated (or predicted) 
data. The space of functions is never provided by the problem poser 
certainly not in the three examples just presented. Often it is not even 
clear what the range and domain of the function should be. For the 
series extrapolation task, to view {x:y} as { a:b , ab:c, abc:b,..., 
abebeded is already problem solving. Thus the key inductive step 
is the assumption of some space of functions. Once this is done the 
problem reduces to finding in this space one function (or perhaps the 
simplest one) that fits the exemplars. 

The second essential feature of the method is the use of a form or 
kernel for the function. This can be matched (in the sense of the match 
method) against the exemplars. Evidence in the items then operates 
directly to specify the actual function from the kernel. Implicit in the 
procedure in Fig. 10.11 is that, inside the match, generation on the kernel 
(refer back to Fig. 10.5) produces, not the parts of the kernel itself, but 
the predictions of the y associated with the presented x. However, parts 
of the kernel expression must show through in these predictions, so that 
the modification operations of the match can specify or modify them in 
the light of differences. When the kernel expression actually has variables 
in it, the prediction from the kernel is sometimes a variable. Its value 
can be made equal to what it should be from the given x:y, and thus 
the kernel expression itself specified. Often the modification operations 
arc like linguistic replacement rules, and then the matter is somewhat 
more complex to dusci ibc. 

•Most, but not nil. Several adapt the paradigm used for pattern recognition 
programs. In addition, a method called the method of successive differences is ap- 
p livable to series extrapolation where the terms of the series arc expressed as numbers. 
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Problem statement 

Given: a domain {x} ; 
a range {y} ; 

a generator of associated pairs {x^y}. 

Find: a function / with domain {x} and range {y} such that/(x) = y 
for all {x:y}. 

Additional assumption (almost never given with the problem statement, 
and therefore constituting the actual inductive step) : 

Given: a set of functions {/} constructable from a set of kernel forms {&}. 

Procedure 

| | / (succeed) 

{&} ' generate ♦ k match 

•tart 

(x:y) * generate — ' 


solution (all xvy work) 


Figure 10,11* Induction method. 


It is not often possible to express the entire space of functions as a 
single form (whence a single match would do the job). Consequently a 
sequential generation of the kernels feeds the match process. Sometimes 
clues, in the exemplars are used to order the generation; more often, 
generation is simply from the simplest functions to the more complex. 

This method is clearly a version of “hypothesis-and-test.” However 
the latter term is usee} much more generally than to designate the class 
of induction problems handled by this method. Furthermore, there is 
nothing in hypothesis-and-test which implies the use of match; it may 
be only generate-and-test. Consequently, we choose to call the method 
simply the induction method, after the type of task it is used for. 

3.6. Summary 

The set of methods just sketched — generate-and-test, hill climbing, 
match, heuristic search, and induction — constitutes a substantial fraction 
of all methods used in heuristic programming. To be sure, this is only p, 
judgment. No detailed demonstration yet exists. Also, one or two im- 
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portant methods are missing; for example, an almost universally used 
paradigm for pattern recognition. 

Two characteristics of the set of methods stand out. First, explicit 
references to processes occur in the problem statement, whereas this is 
not true of mathematical methods, such as the simplex method. Thus 
generate-and-test specifies that you must have a generator and you must 
have a test; then the procedure tells how to organize these. This feature 
seems to be related to the strength of the method. Methods with stronger 
assumptions make use of known processes whose existence is implied by 
the assumptions. In the simplex method generation on a set of variables 
is done over the index, and the tests used are equality and inequality on 
real numbers. Hence there is no need to posit directly, say, the generator 
of the set. 

The second characteristic is the strong similarity of the methods to 
each other. They give the impression of ringing the various changes on 
a small set of structural features. Thus there appear to be only two 
differences between heuristic search and hill climbing. First, it is neces- 
sary to compare for the greater element in hill climbing; heuristic search 
needs only a test for solution (although it can use the stronger compari- 
son, as in means-ends analysis). Second, hill climbing keeps only the 
best element found so far, that is, it searches the problem space from 
where it is. Heuristic search, on the other hand, keeps around a set of 
obtained elements and selects from it where next to continue* the search. 
In consequence, it permits a more global view of the space than hill 
climbing— and pays for it, not only by extra memory and processing, but 
also by the threat of exponential expansion. 

Similarly, the difference between match and heuristic search is pri- 
marily one of memory for past actions and positions. Our diagram for 
match does not reveal this clearly, since it shows only the case of a single 
modification operation, substitution; but with a set of modification oper- 
ations (corresponding to the set of operators in heuristic search) the 
match looks very much like a means-ends analysis that never has to 
back up. 

Finally, the more complex processes, such as LT and the warehouse 
program, seem to make use of the more elementary ones in recognizable 
combinations. Such combination does not always take the form of dis- 
tinct units tied output to input (i.c., of closed subroutines) , but a flavor 
still exists of structures composed of building blocks. 

In reviewing these methods instruction in the details of artificial intel- 
ligence has not been intended. Hopefully, however, enough information 
has been given to convince the reader of two main points: (1) there is in 
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heuristic programming a set of methods, as this term was used in the 
beginning of the paper; and (2) these methods make much weaker de- 
mands for information on the task environment than do methods such 
as the simplex, and hence they provide entries toward the lower, more 
general end of the graph in Fig. 10.3. 

4. THE CONTINUNITY OF METHODS 

If the two hypotheses that we have stated are correct, we should 
certainly expect there to be methods all along the range exhibited in Fig. 
10.3. In particular, the mathematical methods of management science 
should not be a species apart from the methods of artificial intelligence, 
but should be distinguished more by having additional constraints in the 
problem statement. Specific mathematical content should arise as state- 
ments strong enough to permit reasonable mathematical analysis are 
introduced. 

Evidence for this continuity comes from several sources. One is the 
variety of optimization techniques, ranging from hill climbing to the 
calculation methods of the differential calculus, each with increasing 
amounts of specification. Another is the existence of several methods, 
such as so-called branch and bound techniques, that seem equally akin to 
mathematical and heuristic programming. Again, dynamic programming, 
when applied to tasks with little mathematical structure, leads to pro- 
cedures which seem not to differ from some of, the methods in heuristic 
programming, for example, the minimax search techniques for playing 
games such as chess and checkers. 

What we should like most of all is that each (or at least a satisfactory 
number) of the mathematical methods of management science would lie 
along a line of methods that extends back to some very weak but general 
ancestors. Then, hopefully, the effect on the procedure of the increasing 
information in the problem statement would be visible and we could see 
the continuity directly. 

As a simple example, consider inverting a matrix. The normal algo- 
rithms for this are highly polished procedures. Yet one can look at in- 
version as a problem — as it is to anyone who does not know the available 
theory and the algorithms based on it. Parts of the problem space are 
clear: the elements are matrices (say of order n), hence include both the 
given matrix, A, the identity matrix, I, and the desired inverse, X. The 
problem statement is to find X such that AX = I. Simply generating and 
testing is not a promising way to proceed, nor is expressing X as a form, 
multiplying out, and getting n 2 equations to solve. Not only are these 
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poor approaches, but also they clearly are not the remote ancestors of the 

existing algorithms. . _ 

If the inverse is seen as a transformation on A, carrying it into I, a 
more interesting specification develops. The initial object is A, the de- 
sired object is I, and the operators consist of premultiplication (say) by 
some basic set of matrices. Then, if operators Ei, E 2 , • • • > transform 
A into I, we have E* • • • E 2 EiA = I, hence E* • • • E 2 EjI — A .If the 
basic operators are the elementary row operations (permute two rows, 
add one row to another, multiply a row by a constant), we have the 
basic ingredients of several of the existing direct algorithms (those that 
use elimination rather than successive approximation). These algorithms 
prescribe the exact transformations to be applied at each stage, but if 
we view this knowledge as being degraded we can envision a problem solver 
doing a heuristic search (or perhaps hill climbing if the approach were 
monotone). Better information about the nature of the space should 
lead to better selection of the transformation until existing algorithms are 
approached. 

4. 1 . An Example: the Simplex Method 

The simplex method clearly involves both optimization and search, 
hence should eventually show kinship with the methods that we have 
been describing. We should be able to construct a sequence of methods, 
each with somewhat less stringent conditions and therefore with more 
general applicability but less power. Power can be measured here by 
applying the method to the original linear programming (LP) problem, 
where the true nature of the problem is known. 

Figure 10.12 reformulates the LP problem and the simplex algorithm 
in the present notation. Indices have been suppressed as much as possible, 
partly by using the scalar product, in the interests of keeping the figures 
uncluttered. The problem statement for the simplex method, which 
we call SM from now on, is a special case of the LP problem, having 
equalities for constraints rather than inequalitities, but involving n + m 
variables rather than just n. The transformation between problems is 
straightforward (although the original selection of this specialization is 
not necessarily so) . 

The elements of the problem space (called bases) are all the subsets 
of m out of the n + m variables. An element consists of much more 
information than just the subset of variables, of course, namely, of the 
entire tableau shown in Fig. 10.2. The operators theoretically could be 
any rules that replace one subset of in variables with another. In fact, 
they involve adding just a single variable, hence removing one. This 
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Problem statement for LP problem 

Given: a set of n variables, {x}, where each x ^ 0: 
let x be the n-tuple (xi, x t , ... , x„) ; 
a set of m constraints, {g = b — 32} : 
let the feasible region be (x| g ^ 0} ; 
an objective function, z — cx. 

Find: x in the feasible region such that z is maximum. 

Problem statement for SM, the simplex method 

Given: a set of n + m variables, {x}, where each x ^ 0: 
let x be the (n + m)-tuple (x,-, x t , . . . , x„ +m ); 
a set of m constraints, {g = b — 22} : 
let the feasible region be {x\g = 0} ; 
an objective function, z = cx. 

Find: x in the feasible region such that z is maximum. 

Note: any LP problem can be solved if this one can. It is a separate 
problem to provide the translation between them (define x n+l - = g { 
and determine c and the 3 accordingly). 

Problem space for SM 


Elements: { B (bases), the ^ subsets of m variables from {x}} ; 

with B is associated T(B ) (the tableau) containing: 
a feasible x such that x £ B implies x > Oj otherwise x = 0 ; 
the current value of z for x; 

the exchange rate (e) for each x [-(2 - c) in tableau]; 
auxiliary data to permit application of any operator. 
Initial element: B 0 , a feasible basis (not obtained by SM). 

Operators: {x not in B) . 

Proced me 


f Kklution 


(x not in B } ► select • 

L 


f • unbounded 


apply • 


(B, 


T, 


m 


Select (pick x with maximum e ) : 

(x not in 5 } ► generate — ► compare (x. e) 


Figure 10.12. SM : reformulation oi simplex method. 
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would still leave (n - m)m operators (any of n - m in, any of m out), 
except that no choice exists on the one to be removed. Hence, there are 
just n — m operators, specified by the n — m variables not in the 
current basis. Applying these operators to the current element consists of 
almost the entire calculation of the simplex procedure specified in Fig. 
•10.2 (actually steps 2, 3, and 4), which amounts to roughly m{n + m) 
multiplications (to use a time-honored unit of computational effort). 

The procedure in Fig. 10.12 looks like a hill climber with the compar- 
ison missing: as each operator is selected, the new {B,T) is immediately 
calculated (i.e., the tableau updated) and a new operator selected. No 
comparison is needed because the selection produces only a single operator, 
and this is known to advance the current position. The procedure for 
selecting the operator reveals that the process generates over all potential 
operators — over all x not in the current basis — and selects the best one 
with respect to a quantity called the exchange rate (e). Thus the select 
is a simple optimizer, with one exception (not indicated in the figure): 
it is given an initial bias of zero, so that only operators with e > 0 have 
a chance. 

The exchange rate is the rate of change of the objective function (z) 
with a change in x , given movement along a boundary of the constraints, 
where the only variables changing are those already in the basis. Given 
this kind of exploration in the larger space of the n + m variables, e 
measures how fast z will increase or decrease. Thus, e > 0 guarantees 
that the compare routine is not needed in the main procedure. 

The selection of an x with the maximum exchange rate does not 
guarantee either the maximum increase from the current position or the 
minimum number of steps to the optimum. The former could be achieved 
by inserting a compare routine and trying all operators from the current 
position; but this would require many times as much effort for (probably) 
small additional gain. However, since the space is unimodal in the feasible 
region, the procedure does guarantee that eventually an optimum will 
be reached. 

We need a method at the general end of the scale against which the 
SM can be compared. Figure 10.13 gives the obvious one, which we call 
Ml. It retains the shape of the problem but without any specific content. 
The problem is still to optimize a function, /, of n positive variables, sub- 
ject to m inequality constraints, {g}. But the only thing known about / 
and the g is that / is unimodal in the feasible set (which accounts for 
its descriptive title). This justifies using hill climbing as the procedure. 
The operators must be any increments to the current position, either 
positive or negative. Many will produce a location outside the feasible 
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Problem statement for Ml, the unimodal objective method 

Given: a set of n variables, {x}, where each x ^ 0: 
let x be the n-tuple ( x lf z 2 , . . . x n ); 
a set of m constraints, {g(x ) ^ 0} : 
let the feasible region be {x | g ^ 0} ; 
an objective function z = /(x); 

/ is unimodal in the feasible region. 

Find: x in the feasible region such that z is maximum. 

Problem space PS1, for hillclimbing 

Elements: {x}. 

Initial element: x<>, a feasible solution (not obtained by Ml). 

Operators: {Ax, where each Ax is any real number} , 
and x* = x + Ax. 

Procedure 


{Ax} ■* generate ► teat > apply > compare • 

1 




(?} 


‘(z, x) 


Figure 10.13. Ml: unimodal objective method. 


region, but these can be rejected by testing against the g. The procedure 
in the figure does not provide any information on how to generate the 
operators. 


If Ml is compared to. SM, several differences are apparent. First, and 
most striking, the problem space for Ml is n-dimensional Euclidean space, 


whereas for SM it is a finite set of 


or) 


points in (n -f m) -dimen- 


sional space. Thus the search space has been drastically reduced, inde- 
pendently of what techniques are used to search it. Second, and almost as 
striking, Ml has all of {Ax} as operators (i.e., all of n-dimensional space 
again), whereas SM has only n — m operators. Third, a unique operator is 
selected for application on the basis of partial information; it always 
both applies and improves the position. In Ml there is no reason to 
expect an operator either to produce a feasible solution or, if it does, to 
obtain an improvement; thus, extensive testing and comparing must be 
done. Finally, we observe that the cost per step in Ml (when applied to 
the same LP problem as SM) is mk ) where k is the number of variables 
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in Ax and would normally be rather small. Compared to m(m + n) for 
SM, this yields the one aspect favoring Ml. One can take about 
( m + n ) /fc steps in the space of Ml for each step in the space of SM. 
However, the thrashing around necessary at each point to obtain a positive 
step will largely nullify this advantage. . , 

This list of differences suggests constructing a sequence of methods 
that extend from Ml to SM, with decreasing spaces and operators and 
increasing cost per step (to pay for the additional sophistication). Much 
of the gain, of course, will come without changing problem spaces, but 
from acquiring better operator selection. There may be relatively few 
substantial changes of problem space. Figures 10.14 and 10.15 provide 

Problem statement for M2, the monotone objective method 

Given: the conditions of Ml, plus 

/ is monotone in the feasible region. 

Find: x in the feasible region such that z is maximum. 

Problem space PS2, main hill climbing 

Elements: { x on boundary of feasible region (at least one x = 0 or g = 0)} . 
Initial element: x 0 in feasible region (not obtained by M2). 

Operators: {x}, where x' = the point on boundary given by M2*. 

Problem statement for M2% M2-operator method 

Given: the conditions of M2, plus 
x is on the boundary ; 
x £ {x}. 

Find: x on the boundary such that 
Ax to increase z not feasible; 
all other x unchanged; 
z increased. 

Additional assumption for efficiency: 

g(x) = 0 can be solved for any x with all other x fixed. 

Problem space for M2% hill climbing 

Elements: {x}. 

Initial element: x, given by M2 operator. 

Operators: {Ax, with appropriate sign} , where x' = x + Ax. 

Problem space PS1 for Ml 
Used as backup when PS2 terminates without optimum. 

Figure 10.14. M2: monotone objective method. 
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Problem statement for M3, the consistent exchange problem 

Given: the conditions of M2, plus 

if an r is exchanged for other variables by moving along a maxi- 
mal boundary, then Az has a consistent sign. 

Find: x in the feasible region such that 2 is a maximum. 

Problem space PS3, main hill climbing 

Elements: {x on the maximal boundary of feasible set; i.e., no x can be 
changed to increase 2 , holding other x fixed}. 

Initial element: xo, a feasible solution on the maximal boundary (not 
; obtained by M3). 

Operators: {x}, where x' = the point on the maximal boundary riven 
by M3*. 

Problem statement for M3*, M3-operator method 

Given: the condition of M3, plus 

x is on a maximal boundary; 
fG{x}. 

Find: x on the maximal boundary, such that exchange for x to increase 
2 is not feasible; 

2 increased. 

Additional assumption for efficiency: 

any system of k equations, {ff(x) = 0} , can be solved for any set 
of k variables with the others fixed. 

Problem space for M3* 

Elements: {x} . 

Initial element: x, given by M3 operator. 

Operators: {Ax, with appropriate sign} and {Ax, subsets of {x}}, where 
s' = x + Ax. 

Figure 10.15. M3: consistent exchange method. 


two that seem to reflect some of the major boundaries to be crossed in 
getting from Ml to SM. 

Figure 10.14 shows method M2, which adds the assumption that the 
objective function,/, is monotone in the feasible region. In the LP problem 
dj/Ox is constant, but this is not required to justify the main conclusion; 
namely, that if a given change in a varible is good, more change in the 
same direction is better. The effect of this is to create new operators and, 
through them, a new space. The basic decision is always to drive a varia- 
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ble to a boundary (in the direction of increasing 2 , of course). Thus the 
space for M2 becomes the boundary set of the original space i for Ml 
(those points where at least one of the constraints, including the x ^O, 
attains zero). The operators in the space of M2 are full steps to .a 
boundary (what are sometimes called macroinoves m artificial mte 1 - 
gence). Now, finding the boundary is still a problem, although a more 
manageable one. Thus M2 has a second problem method, M2 , for this 
purpose. As described in Fig. 10.14 it can be a simple hill climber. 

An additional strong assumption has been made in the procedure of 
M2, namely, that only changes in a single variable, x, will be considered. 
This reduces the number of operators, as well as making the operator 
submethod M2* simpler. It is not justified by the assumptions of the 
problem statement, however, and consequently M2 will terminate at 
suboptimal positions where no single variable can be changed to ^crease 
z without decreasing some other variables. (This is often called the 
maximal or the Pareto optimum set.) Rather than relax the operators to 
a wider class, the original method, Ml, is held in reserve to move off the 
maximal set. (However, if done with small steps, this is extremely 
inefficient for the LP problem, since the system just jitters its way slowly 


up a bounding plane.) . . 

The description of M2* gives an additional assumption: each equation 
g (f)can be solved directly for any x, given that the values of the other x s 
are determined. This permits direct calculation of the extreme value of 
x on a boundary that is maximal. Slightly stronger conditions on the gs 
allow determination of the first constraint to become binding, withou. 

multiple evaluations. ...... 

Figure 10.15 shows method M3, which adds the assumption that the 
exchange rate (e) always has a consistent sign as one moves along t e 
feasible region, in response to introducing a variable x (what we have 
called exchanging). Again, in the LP problem e is constant, but this 
is not required to justify the main conclusion: that in a maximal situa- 
tion, if adjustments are made in other variables to allow a par ic u ar 
variable to increase, and the gain from the exchange is positive, it will 
always be positive; hence the new variable should be exchanged for as 
much as possible, namely, until another boundary is reached. This 
assumption not only allows a better way of dealing with the maximal 
cul-de-sac than does M2, with its regression to Ml, but also permits 
the problem space to be changed radically a second time. 

The elements of the space now become the set of maximal points, 
thus a small subset of the space of M2. The operators remain the 
same ; the individual variables. The application of an operator again cor- 
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responds to the solving of a subproblem, hence is accomplished by a 
submethod, M2*. The problem is as follows: given x (with a positive 
exchange rate), to advance it as far as possible. This means solving the 
constraints simultaneously for the variables, so as to remain on a bound- 
ary. As a change in the selected x is made, the current x moves off the 
maximal boundary by violating either the constraints or maximality. Ad- 
justments must be made in the other x's to restore these conditions. 
What the new clause in the problem statement provides is not a way of 
making the adjustments, but a guarantee that if a change is once found 
that does increase z (after adjustment) it should be pushed to the limit. 

We have not described M3*, other than to indicate the available 
operators. At its most general (i.e., assuming no other information), it 
requires a two-stage process, one to discover a good direction and the 
other to push it. The latter is again a two-stage process, one to change 
the selected x and the other to make the adjustments. We have included 
an additional assumption, similar to the one for M2*, that a direct way 
exists of solving systems of contraints for some variables in terms of 
others. This clearly can make an immense difference in the total efficiency 
of problem solving but does not alter the basic structuring of the task. 

M3 is already a recognizable facsimile of SM. The space has been 
cut to all subsets of the variables, although the final contraction to sub- 
sets of 77i variables has not occurred. (It is implicit in the problem 
statement of M3, with some mild conditions on the g* s, but has not been 
brought out.) The operators of M3 and SM are the same. More pre- 
cisely, they are isomorphic — the process of applying an operator is quite 
different in the two methods. There are still some steps to go. The kinds 
of methods that are possible for the operator need explication. They are 
represented in M3* only by the assumption that systems of equations 
can be solved. But the schemes in SM use special properties of linear 
systems. Similarly, we have not explored direct calculation of the ex- 
change rates, with the Subsequent replacement of comparison in the main 
method by comparison in the operator, to avoid expensive computation. 

We have not carried this example through m complete detail, nor have 
we established very many points on the path from a crude hill climber 
to SM. The two points determined are clearly appropriate ones and 
capture some of the important features of the method. They are not un- 
expected points, of course, since linear programming is well understood. 
The viewpoint underlying the analysis is essentially combinatorial, and 
such aspects have been thoroughly explored (e.g., see [23]). If these 
intermediate problems have any peculiar flavor, it is that they become 
established where the search spaces change, and these need not always 
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correspond to nice mathematical properties, abstractly considered. Thus 
convexity is not posited and its implications explored; rather a change 
of search space is posited and the problem statement that admits it sought. 

A single ancestral lineage should not be expected. Just as theorems 
can have many proofs, so methods can have many decompositions of 
their information. In fact, in one respect at least the line represented 
by M2 and M3 does violence to SM. It never recognizes the shift of 
problem into a set of equality constraints with the consequent change in 
dimensionality. Thus, the g’s and the x’s are handled separately, whereas 
it is a very distinct feature of the simplex procedure that it handles them 
uniformly. One could easily construct another line starting from SM, 
which would preserve this feature. (It would face a problem of making 
the transition to Ml.) 

The examples selected— linear programming and matrix inversion— 
are certainly ones that seem most amenable to the kind of analysis we 
have proposed. If we considered methods, say, for determining inventory 
levels, the story might be different. Nevertheless, perhaps the case, for 
continuity between weak and strong methods has been made plausible. 

5. HUMAN BEHAVIOR IN ILL-STRUCTURED PROBLEMS 

In the two issues discussed so far — the existence of weak methods, 
and the continuity between weak and strong methods we have not 
seemed to be dealing directly with ill-structured problems. To re-evoke 
the concern of Reitman, the problem statements that we have exhibited 
seem quite precise. (Indeed, we took pains to make them so and in a more 
technical exposition would have completely formalized them.) According 
to our hypotheses the world is always formalized, seen from the view- 
point of the methods available, which require quite definite properties 
to operate. A human problem solver, however, would not feel that a 
problem was well structured just because he was using a method on it. Our 
second hypothesis identifies this feeling with the low power of the appli- 
cable methods. 

The concern just' expressed is still well taken. If we examine some 
problem solvers who are working on “really ill-structured problems, 
what will we find? They will necessarily be human, since as noted 
earlier, men are the gatekeepers of this residual class of problems. Thus 
we cannot observe their problem-solving processes directly but must infer 
them from their behavior. 

To have something definite in mind consider the following problem 
solvers and tasks: 
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A financial adviser: what investments should be made in a new account? 

A foreman: is a given subordinate well adjusted to his work? 

A marketing executive: which of two competitors will dominate a 
given market to which his firm is considering entry? 

None of these problems is as ill structured as the proverbial injunctions 
to “know thyself” (asked of every man) and to “publish or perish” 
(asked of the academician). Still they are perhaps more typical of man- 
agement problems than these two almost completely open-ended problems. 
They do have the feature of most concern to Reitman; namely, neither 
the criteria for whether a solution is acceptable nor the data base upon 
which to feed are particularly well defined. 

The framework we have been using says that below the surface we 
should discover a set of methods operating. Our two hypotheses assert, 
first, that we should find general but weak methods; and, second, that 
we should not find methods that deal with the unstructured aspects 
(however they come to be defined) through any mechanism other than 
being general enough to apply to a situation with little definite informa- 
tion. 

Our first implication would seem to be upset if we discover that the 
human being has available very strong methods that are applicable to 
these ill-structured problems. This is a rather difficult proposition to 
test, since, without the methods themselves to scrutinize, we have very 
little basis for judging the nature of problem solving. Powerful methods 
imply good solutions, but if only men solve the problem, comparative 
quality is hard to judge. 

The three tasks in our- list have the virture that comparisons have 
been made between the solutions obtained by human effort and those 
obtained by some mechanical procedure. For the second two the actual 
tasks are close nonmanagement analogs of the tasks listed. However, 
they all have the property (implicit in our list) that the problem solver 
is a man who by profession is concerned with solving the stated type of 
problem. This condition is important in discussing real management 
problems, since the capabilities of a novice (e.g., a college student used 
as a subject in an experiment) may differ considerably from those of the 
professional. In particular, the novice may differ in the direction of using 
only very general reasoning abilities (since he is inexperienced), whereas 
the professional may have special methods. 

In all of the cases the result is the same. Rather simple mechanical 
procedures seem to do as well as the professional problem solver or even 
better, certainly they do not do worse. 
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The first task was investigated by Clarkson [1] in one of the early 
simulation studies. He actually constructed a program to simulate a 
trust investment officer in a bank. Thus the program and the human 
being attain the same level of solution. The program itself consists of a 
series of elementary evaluations as a data base, plus a recognition 
structure (called a discrimination net) to make contact between the 
specific situation and the evaluation; there are also several generate-and- 
tests. Thus the program does not have any special mechanisms for 
dealing with ill-structuredness. Indeed it deals with the task in a highly 
structured way, though with a rather large data base of information. 
The key point is that the human being, who can still be hypothesized 
to have special methods for ill-structured situations (since his internal 
structure is unknown) , does not show evidence of this capability through 

superior performance. _ 

The second task in its nonmanagement form is that of clinical judg- 
ment. It has been an active, substantial— and somewhat heated— concern 
in clinical psychology ever since the forties. In its original form, as re- 
viewed by Meehl [12], it concerned the use of statistical techniques 
versus the judgments of clinicians. With the development of the computer 
it has broadened to any programmed procedure. Many studies have been 
done to confront the two types of judgment in an environment sufficiently 
controlled and understood to reveal whether one or the other was better. 
The results are almost uniformly that the programmed procedures perform 
no worse (and often better) than the human judgment of the professional 
clinician, even when the clinician is allowed access 'to a larger “data base” 
in the form of his direct impressions of the patient. Needless to say, 
specific objections, both methodological and substantive, have been 
raised about various studies, so the conclusion is not quite as clear-cut 
as stated. Nevertheless, it is a fair assertion that no positive evidence of 
the existence of strong methods of unknown nature has emerged.* 

The third task is really an analog of an analog. Harris [7], in order 
to investigate the clinical versus statistical prediction problem just 
discussed, made use of an analog situation, which is an equally good 
analog to the marketing problem in our list. He tested whether formulas 
for predicting the outcome of college football games are better than human 
judgment. To get the best human judgments (i.c., professional) he made 
use of coaches of rival teams. Although there is a problem of bias, these 
coaches clearly have a wealth of information of as professional a nature 

7 A recent volume [10] contains some recent papers in this area, which provide 
access to tlie literature. 
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as the marketing manager has about the market for his goods. On the 
program side, Harris used some formulas whose predictions are published 
each week in the newspapers during the football season. An unfortunate 
aspect of the study is that these formulas are proprietary, although 
enough information is given about them to make the study meaningful. 
The result is the same: the coaches do slightly worse than the formulas. 

Having found no evidence for strong methods that deal with unstruc- 
tured problems, we might feel that our two hypotheses, are somewhat more 
strongly confirmed. However, unless the weak methods used by human 
beings bear some relationship to the ones we have enumerated, we should 
take little comfort. For our hypotheses take on substantial meaning 
only when the weak methods become explicit. There is less solid evidence 
on what methods people use than on the general absence of strong 
methods. Most studies simply compare performance, and do not attempt 
to characterize the methods used by the human problem solver. Likewise, 
many of the psychological studies on problems solving, although positive 
to our argument [14], employ artificial tasks that are not sufficiently 
ill structured to aid us here. The study by Clarkson just reviewed is an 
exception, since he did investigate closely the behavior of his investment 
officer. The evidence that this study provides is positive. ' 

. Numerou s studies in the management science literature might be 
winnowed either to support or refute assertions about the methods used 
. °rk in th e behavioral theory of the firm [2], for instance, provides a 
picture of the processes used in organizational decision making that is 
highly compatible with our list of weak methods — searching for alter- 
native^ changing levels of aspiration, etc. However, the characterizations 
are sufficiently abstract that a substantial issue remains whether they 
can be converted into methods that really do the decision making. Such 
descriptions abstract from task content. Now the methods that we 
have described also abstract from task content. But we know that these 
can be specialized to solve the problems they claim to solve. In empirical 
studies we do not know what other methods might have to be added to 
handle the actual detail of the management decision. 

Only rarely are studies performed, such as Clarkson’s, in which the 
problem is ill structured, but the analysis is carried out in detail. Reit- 
man has studied the composition of a fugue by a professional composer 
118], which is certainly ill structured enough. Some of the methods we 
have described, such as means-ends analysis, do show up there. Reitman’s 
characterization is still sufficiently incomplete, however, that no real 
evidence is provided on our Question. 
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6. DIFFICULTIES 

We have explored three areas in which some positive evidence can be 
adduced for the two hypotheses. We have an explicit set of weak met - 
ods- there is some chance that continuity can be established between 
the weak and the strong methods; and there is some evidence that human 
beings do not have strong methods of unknown nature for dealing with 
ill-structured problems. Now it is time to consider some difficulties with 
our hypotheses. There are several. 


6.1. The Many Parts of Problem Solving . 

At the beginning of this essay we noted that methods were only a 
part of problem solving, but nevertheless persisted in ignoring all the 
other parts. Let us now list some of them: 


Recognition 
Evaluation 
Representation 
Method identification 


Information acquisition 
Executive construction 
Method construction 
Representation construction 


A single concern applies to all of these items. Do the aspects of prob- 
lem solving that permit a problem solver to deal with ill-structured 
problems reside in one (or more) of these parts, rather than m the 
methods? If so, the discussions of this essay are largely beside the point. 

This shift could be due simply to the power (or generality) of a 
problem solver not being localized in the methods rather than to anything 
specific to ill-structuredness. The first two items on the list illustrate 
this possibility. Some problems are solved directly by recognition; for 
example, who is it that has just appeared before my eyes? In many 
problems we seem to get nowhere until we suddenly “just recognize 
the essential connection or form of the solution. Gestalt psychology has 
made this phenomenon of sudden restructuring central to its theory of 
problem solving. If it were true, our two hypotheses would certainly no.t 
be valid. Likewise for the second item, our methods say more about 
the organization of tests than about the tests themselves. Perhaps most 
of the power resides in sophisticated evaluations. This would work 
strongly against our hypotheses. In both examples it is possible, of course, 
that hypotheses of similar nature to ours apply. In the case of evalu- 
ations, for example, it might be that ill-structured problems could e 
handled only because the problem solver always had available some dis- 
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tinctions that applied to every situation, even though with less and less 
relevance. 

The third item on the list, representation of problems, also raises a 
question of the locus of power (rather than of specific mechanisms re- 
lated to ill-structured problems). At a global level we talk of the repre- 
sentation of a problem in a mathematical model, presumably a trans- 
ition from its representation in some other global form, such as natural 
language. These changes of the basic representational system are clearly 
of great importance to problem solving. It seems, however, that most 
problems, both well structured and ill structured, are solved without such 
shifts. Thus the discovery of particularly apt or powerful global repre- 
sentations does not lie at the heart of the handling of ill-structured prob- 
Jems. 

More to the point might be the possibility that only special represen- 
tations can hold ill-structured problems. Natural language or visual 
imagery might be candidates. To handle ill-structured problems is to be 
able to work in such a representation. There is no direct evidence to sup- 
port this, except the general observations that human beings have (all) 
such representations, and that we do not have good descriptions of them. 

More narrowly, we often talk about a change in representation of a 
problem, even when both representations are expressed in the same lan- 
guage or imagery. Thus we said that Fig. 10.12 contained two represen- 
tations of the LP problem, the original and the one for the simplex 
method. Such transformations of a problem occur frequently. For ex- 
ample to discuss the application of heuristic search to inverting matrices 
we had to recast the problem as one of getting from the matrix A to 7, 
rather than of getting from the initial data (A, I, AX = 1) to X. Only 
after this step was the application of the method possible. A suspicion 
arises that changes of representation at this level— symbolic manipulation 
into equivalent but more useful form— might constitute a substantial 
part of problem solving. Whether such manipulations play any special 
role in handling ill-structured problems is harder to see. In any event 
current research m artificial intelligence attempts to incorporate this type 
of problem solving simply as manipulations in another, more symbolic 
problem space. The spaces used by theorem provers, such as LT are 
relevant to handling such changes. 1 

Method identification, the next item, concerns how a problem state- 
ment of a method comes to be identified with a new problem, so that each 
of the terms m the problem statement has its appropriate referent in the 
problem as originally given. Clearly, some process performs this identifi- 
cation, and we know from casual experience that it often requires an 
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exercise of intellect. How difficult it is for the LP novice to “see” a new 
problem as an LP problem, and how easy for an old hand! 

Conceivably this identification process could play a critical role m 
dealing with ill-structured problems. Much of the structuring of a prob- 
lem takes place in creating the identification. Now it might be that 
methods still play the role assigned to them by our hypotheses, but even 
so it is not possible to instruct a computer to handle ill-structured prob- 
lems, because it cannot handle the identification properly. Faced with 
an appropriate environment, given the method and told that it was the 
applicable one, the computer still could not proceed to solve the problem. 
Thus, though our hypotheses would be correct, the attempt to give them 
substance by describing methods would be misplaced and futile.. 

Little information exists about the processes of identification in situ- 
ations relevant to this issue. When the situation is already formalized, 
matching is clearly appropriate. But we are concerned precisely with 
identification from a unformalized environment to the problem state- 
ment of a method. No substantial programs exist that perform such a 
task. Pattern recognition programs, although clearly designed to work 
in “natural” environments, have never been explored in an appropriately 
integrated situation. Perhaps the first significant clues will come out of 
the work, mentioned at the beginning of this chapter and still m its early 
stages, on how a machine can use a hand and eye in coordination. Al- 
though the problems that such a device faces seem far removed from 
management science problems, all the essentials of method identification 
are there in embryo. (Given that one has a method for picking up blocks, 
how does one identify how to apply this to the real world, seen through 
a moving television eye?) 

An additional speculation is possible. The problem of identification 
is to find a mapping of the elements in the original representation (say, 
external) into the new representation (dictated by the problem state- 
ment of the method to be applied). Hence there are methods for the 
solution to this, just as for any other problem. These methods will.be 
like those we have exhibited. (Note, however, that pattern recognition 
methods would be included.) The construction of functions in the. in- 
duction method may provide some clues about how this mapping might 
be found. As long as the ultimate set of contacts with the external repre- 
sentation (represented in these identification methods as generates and 
tests) were rather elementary, such a reduction would indeed answer the 
issue raised and leave our hypotheses relevant. 

An important aspect of problem solving is the acquisition of new 
information, the next item on the list. This occurs at almost every step, 
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of course, but most of the time it is directed at a highly specific goal; for 
instance, in method identification, which is a major occasion for assimi- 
lating information, acquisition is directed by the problem statement. In 
contrast, we are concerned here with the acquisition of information to be 
used at some later time in unforeseen ways. The process of education 
provides numerous examples of such accumulation. 

For an ill-structured problem one general strategy is to gather ad- 
ditional information, without asking much about its relevance until ob- 
tained and examined. Clearly, in the viewpoint adopted here, a problem 
may change from ill structured to well structured under such a strategy, 
if information is picked up that makes a strong method applicable. 

The difficulty posed for our hypotheses by information acquisition is 
not in assimilating it to our picture of methods. It is plausible to assume 
that there are methods for acquisition and even that some of them might 
be familiar, for example, browsing through a scientific journal as generate- 
and-test. The difficulty is that information acquisition could easily play 
a central role in handling ill-structured problems but that this depends 
on the specific content of its methods. If so, then without an explicit 
description of these methods our hypotheses cannot claim to be relevant. 
These methods might not formalize easily, so that ill-structured problems 
would remain solely the domain of human problem solvers. The schemes 
whereby information is stored away yet seems available almost instantly 
—as in the recognition of faces or odd relevant facts— are possibly aspects 
of acquisition methods that may be hard to explicate. 

The last three items on the list name things that can be constructed 
by a problem solver and that affect his subsequent problem-solving be- 
havior. Executive construction occurs because the gross shape of a par- 
ticular task may have to be reflected in the top-level structure of the 
procedure that solves it. The induction method, with the three separate 
induction tasks mentioned, provides an example. Each requires a separate 
executive structure, and we could not give a single unified procedure to 
handle them all. Yet each uses the same fundamental method. Relative 
to our hypotheses, construction seems only to provide additional loci for 
problem-solving power. This item could become important if it were 
shown that solutions are not obtained to ill-structured problems without 
some construction activity. 

The extended discussion of the parts of the problem-solving process 
other than methods, and the ways in which they might either refute or 
nullify our two hypotheses, stems from a conviction that the major 
weakness of these hypotheses is the substantial incompleteness of our 
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knowledge about problem solving. They have been created in res P°° s e 
to partial evidence, and it seems unlikely that they will emerge unscathed 
as some of these other parts become better known. 

6.2. Measures of Informational Demands 

Throughout the chapter we have talked as if adding information to a 
problem statement leads to a decrease in generality and an increase m 
power. Figure 10.3 is the baldest form of this assertion. At the most 
general level it seems plausible enough. Here one crudely identifies the 
number of conditions in the problem statement with the size of the space 
being searched: as it gets smaller, so the problem solver must grow more 
powerful. At a finer level of analysis, however, this assertion seems 
often violated, and in significant ways; for example, a linear program- 
ming problem is changed into an integer programming problem by the 
addition of the constraint that the variables { * } range over the positive 
integers rather than the positive reals. But this makes the problem 
harder, not easier. Of course, it may be that existing methods of integer 
programming are simply inefficient compared to what they could be. 
This position seems tenuous, at best. It is preferable, I think, to ta e as 
a major difficulty with these hypotheses that they are built on foundations 

of sand. 


6.3. Vague Information 

It is a major deficiency of these hypotheses (and of this chapter) that 
they do not come to grips directly with the nature of vague information. 
Typically, an ill-structured problem is full of vague information. This 
might almost be taken as a definition of such a problem, except that the 
term vague is itself vague. 

All extant- ideas for dealing with vagueness have one concept m com- 
mon: they locate the vagueness in the referent of a quite definite (hence 
un-vague) expression. To have a probability^ to have an lndefim e 
event, but a quite definite probability. To have a subset is to have a 
quite definite expression (the name or description of the subset) which 
is used to refer to an indefinite, or vague, element. Finally, the constructs 
of this chapter are similarly definite. The problem solver has a definite 
problem statement, and all the vagueness exists in the indefinite set ot 
problems that can be identified with the problem statement. 


•Reitman’s proposals, 
definite character [173. 


although we have not described them here, have the same 
So also does the proposal by Zadch for “fuzzy* sets [24]. 
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The difficulty with this picture is that, when a human problem solver 
has a problem he calls ill structured, he docs not seem to have definite 
expressions which refer to his vague information. Rather he has nothing 
definite at all. As an external observer we might form a definite expres- 
sion describing the range (or probability distribution) of information 
that the subject has, but this “meta” expression is not what the subject 
has that is this information. 

It seems to me that the notion of vague information is at the core 
of the feeling that ill-structured problems are essentially different from 
well-structured ones. Definite processes must deal with definite things, 
say, definite expressions. Vague information is not definite in any way. 
This chapter implies a position on vague information; namely, that there 
are quite definite expressions in the problem solver (his problem state- 
ment) . This is a far cry from a theory that explains the different varieties 
of vague information that a problem solver has. Without such expla- 
nations the question of what is an ill-structured problem will remain 
only half answered. 

7. CONCLUSION 

The items just discussed— other aspects of problem solving, the meas- 
urement of power and generality, and the concept of vagueness — do not 
exhaust the difficulties or deficiencies of the proposed hypotheses. But 
they are enough to indicate their highly tentative nature. Almost surely 
the two hypotheses will be substantially modified and qualified (probably 
even compromised) with additional knowledge. Even so, there are excel- 
lent reasons for putting them forth in bold form. 

The general nature of problems and of methods is no longer a quasi- 
philosophic enterprise, carried on in the relaxed interstices between the 
development of particular mathematical models and theorems. The de- 
velopment of the computer has initiated the study of information proc- 
essing, and these highly general schema that we call methods and problem- 
solving strategies are part of its proper object of study. The natureof 
generality in problem solving and of ill-structuredness in problems is 
also part of computer science, and little is known about either. The as- 
sertion of some definite hypotheses in crystallized form has the virtue 
of focusing on these topics as worthy of serious, technical concern. 

These two hypotheses (to the extent that they hold true) also have 
some general implications for the proper study of management science. 
They say that the field need not be viewed as a collection of isolated 
mathematical gems, whose application is an art and which is largely 
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excluded from the domain of “nonquantifiable aspects of management. 
Proper to management science is the creation of methods general enough 
to apply to the ill-structured problems of management — taking them on 
their own terms and dealing with them in all their vagueness and not 
demanding more in the way of data than the situations provide. To be 
sure, these methods will also be weak but not necessarily weaker than 
is inherent in the ill-structuring of the task. 

That management science should deal with the full range of manage- 
ment problems is by no means a new conclusion. In this respect these 
two hypotheses only reinforce some existing strands of research and ap- 
plication. They do, however, put special emphasis on the extent to which 
the hard mathematical core of management science should be involved 
in ill-structured problems. They say such involvement is possible. 
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Page 368 
Page 369 

Page 371 
Page 386 

Page 387 

Page 389 
Page 389 
Page 395 
Page 400 
Page 401 


Figure 10.1, second line under the figure should read 
"is not under the control of the inputting process." 

Paragraph 1.1, line 9 should read 

"a.., b., c., i = 1, •<•) tn; j = 1, •••> u" 
ij i J 

Third paragraph in the formula should read 
"Procedure: compute x = -b/2a + l/2a v b - 4ac." 

Third paragraph, fifth line should read 

"applied to elements in the problem space produce new 

elements. (Operators need not" 

Line 8 in formula should read 

'Wi ••• W ••• ) * x d" 

Line 10 change (m,x) to (m,t) 

Line 4 from the bottom should read 

fm t'l + x' 

"Apply; v - ---> match > construct > 

Paragraph 4.1., line 5 

change "applicablilty " to "applicability" 

Line 3 from the bottom 
change 'Varible" to 'Variable" 

Bottom line replace (;) with (:) 
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This paper is • survey of Artificial Intelligence (AX). It divides the field into four core topics 
(embodying the base for a science of intelligence) and eight applications topics (in which research has 
been contributing to core ideas). The paper discusses the history, the major landmarks, and some of 
the controversies in each of these twelve topics. Eaph topic Is represented by a chart citing the 
major references. These references are contained in an extensive bibliography. The paper concludes 
with a discussion of some of the criticisms of AI and with some predictions about the course of future 
research. 


1. INTRODUCTION 

Can we ever hope to understand the nature of Intelli- 
gence in the same sense that we understand, say, the 
nature of flight? will our understanding of intel- 
ligence ever be sufficient to help us build working 
models— machines that think and perceive— in the same 
way that our understanding of aerodynamics helps us 
build airplanes? Intelligence seems ao varied. We 
see it when a chemist discovers the structure of a 
complex molecule, when a computer plays chess, when 
a mathematician finds a proof, and even when a child 
walks home from school. Are there basic mechanisms 
or processes that are common to all, of these activi- 
ties and to all others commonly thought to require 
intelligence? 

The field of Artificial Intelligence (AI) has as its 
main tenet that there are indeed common processes 
that underlie thinking and perceiving, and further- 
more that these processes cAn be understood and 
studied scientifically. The processes themselves do 
not depend on whether the subject being thought about 
or perceived is chemistry, chess, mathematics, or 
childhood navigation. In addition, it is completely 
unimportant to the theory of AI who is doing the 
thinking or perceiving— man or computer. This is an 
implementations! detail. 

These are the emerging beliefs of a group of computer 
scientists claiming to be founding a new science of 
Intelligence. While attempting to discover and 
understand the basic mechanisms of intelligence, 
these researchers have produced working models in the 
form of computer programs capable of some rather im- 
pressive feats: playing competent chess, engaging 

in limited dialogs with humans in English, proving 
reasonably difficult mathematical theorems in set 
theory, analysis, and topology, guessing (correctly) 
the structure of complex organic molecules from mass- 
spectrogram data, assembling mechanical equipment 
with a robot hand, and proving the correctness of 
small computer programs. 

Whether the activities of these workers constitute a 
new scientific field or not, at the very least AI is 
a major campaign to produce some truly remarkable 
computer abilities. Like going to the moon or 
creating life, it is one of man's grandest enter- 
prises, As with all grand enterprises, it will have 
profound influences on nan's way of life and on the 


way in which he views himself. In this paper, I 
will try to describe the AI campaign, how it seems 
to be organized into subcampaigns, who is doing 
what, some of the current internal controversies, 
and the main achievements. There is the usual word 
of caution: I've made some rather large simplifica- 

tions in attempting to stand aside from the field 
and look at it with perspective. Not all workers 
would necessarily agree with what follows. 

Before beginning we must discuss an important char- 
acteristic of AI as a field, namely, that it does 
not long retain within it any of its successful ap- 
plications. Computer aides to mathematicians, such 
as differential equation solvers, that originated 
(at least partly) from AI research, ultimately be- 
come part of applied mathematics. A system, named 
DENDRAL, that hypothesizes chemical structures of 
organic molecules based on mas a- spectrogram data is 
slowly escaping its AI birthplace and will likely 
become one of the standard tools of chemists. This 
phenomenon is well-recognized by AI researchers and 
has led one of them to state that AI is known as the 
"no-win” field. It exports all of its winning ideas, 

On reflection, this la not surprising. When a field 
takes as its subject matter all of thinking , and 
then when particular brands of that thinking are 
applied to chemiatry, mathematics, physics, or what- 
ever, these applications become parts of chemistry, 
mathematics, physics, etc. When people think about 
chemistry, we call it part of chemistry— not an ap- 
plication of psychology. The more successful AI be- 
comes, the more its applications will become part of 
the application area. 

Destined apparently to lack an applied branch, la 
there a central core or basic science of AI that 
will continue to grow and contribute nweded Ideas to 
applications in other areas? I think the answer la 
yes. Just what form these central ideas will ulti- 
mately take is difficult to discern now. Will AI be 
something like biology— diverse but still united by 
the common structure of DNA? What will be the DNA 
of AI? 

Or will the science of AI be more like the whole of 
science Itself — united by little more than some 
vague general principles such as the scientific 
method? It is probably too early to tell. The 





present central Idea* see* aor* specific than does 
the scientific Method but less concrete than DMA. 

2. WHAT IS HAPPENING IN AI? 

2« 1 The structure of the field 

As s tactic in attempting to discover the basic 
principles of. Intelligence, AI researchers have set 
thea selves the preliminary goal of building computer 
programs that can perform various intellectual tasks 
that humans can perform. There are major projects 
currently under vay vhose goals are to understand 
natural language (both written and spoken), play 
master chess, prove non- trivial mathematical 
theorems, vrlte. computer programs, and so forth. 
These projects serve two purposes, first, they pro- 
vide the appropriate settings in which the basic 
mechanisms of Intelligence can be discovered and 
clfcrlfied. Second, they provide non-trivial oppor- 
tunities for the application and testing of such 
mechanisms that are already known. I am calling 
these projecti the first-level applications of AI. 

I have grouped these first-level applications (some- 
what arbitrarily) into eight topics shown spread 
along the periphery of Figure 1. These are the 
eight that I think have contributed the most to our 
basic understanding of intelligence. Each has 
strong ties to other <non-AI) fields, as well as to 
each other; the major external ties are indicated by 
arrows in Figure 1. 

Basil mechanisms of lntslllgence and implementa- 
tions! techniques that are common to several appli- 
cations, 1 call core topics. It seems to me that 
there are four major parts to this central core: 

• Techniques for modeling and representation of 
knowledge. 

• Techniques for common sense reasoning, deduction, 
snd problem solving. 

• Techniques for heuristic search. 

• AI systems and languages. 

These four parts are shown at the center of Figure 1. 
Again, we have indicated ties to other fields by 
arrows. It must be stressed that most AI research 
takes plsce in the first-level applications areas 
even though the primary goal may be to contribute 
to the more abstract core topics. 

If an application is particularly successful, it 
might be noticed by specialists in the application 
ares snd developed by them as s useful snd economi- 
cally viable product. Such applications we might 
call second- level applications to distinguish them 
from the first-level applications projects under- 
taken by the AI researchers themselves. Thus, when 
AI researchers work on s project to develop s proto- 
type system to understand speech, I call it s first- 
level application. If General Motora were to 
develop and inatall in their aasembly planta a aya- 
tem to Interpret television Images of automobile 
parts on s conveyor belt, I would call it a second- 
level application. (We should humbly note that per- 
haps several second- level applications will emerge 
elthout benefit of obvious AX parentage. In fact, 
these may contribute mightily to AI science Itself.) 

Thus, even though I agree that AI is a field that 
cannot retain its applications, it is the aeeond- 
level application* that It lacks. These belong to 


the applications areas themaelves. Until all of the 
principles of intelligence are uncovered, AI re- 
searchers will continue to search for them in various 
first-level applications areas. 

Figure 1, then, divides work in AI into twelve major 
topics. I have attempted to show the major papers, 
projects, and results in each of these topics in 
Charts 1 through 12, each containing references to an 
extensive bibliography at the end of this paper. 

These charts help organize the literature as eel l as 
Indicate something about the structure of work in the 
field. By arrows linking boxes within the charts we < 
attempt to indicate how work has built on (or has 
been provoked by) previous work. The lteas in the 
bibliography are coded to Indicate the subheading to 
which they belong. I think that the charts (taken as 
a whole) fairly represent the important work even 
though there may be many differences of opinion among 
workers about some of the entries* (and especially 
about how work has built on previous work). 

Obviously, a short paper cannot be exhaustive. But 
in this section I will summarize what is going on in 
AI research by discussing the major accomplishments 
and status of research in each of the twelve sub- 
headings. 

2. 2 The core topics 

Fundamentally, AI is the science of knowledge — how to 
repreaen t knowledge snd how to obtain and use knowl- 
edge. Our core topics deal with these fundamentals. 
The four topics are highly interdependent, snd the 
reader should be warned that it is probably wrong to 
attempt to think of them aeparately even though we 
are forced to write about them separately. 

2*2.1 Common-sense reasoning, deduction, and 
problem- aolving (Chart 1) 

By reasoning, etc., we mean the major processes in- 
volved In using knowledge: Using it to make infer- 

ences snd predictions, to make plans, to answer 
questions, and to obtain additional knowledge. As a 
core topic, we are concerned mainly with reasoning 
about evwryday , common domains (hence, common sens#) 
because such reasoning is fundamental, and we want 
also to avoid the possible trap of developing tech- 
niques applicable only to some specialized domain. 
Nevertheless, contributions to our ideas about the 
use of knowledge have come from all of the applica- 
tions areas. 

There have been three major themes evident in this 
cor * topic. We might label these puzzle-solving, 
question- answering, and comaon-sense reasoning. 

Puzzle-solving . Early work on reasoning concentrated 
on writing computer programs that could solve simple 
puzzles (tower of Hanoi, missionaries snd cannibals, 
logic problems, etc.). The Logic Theorist snd GPS 
(see Chart 1) are typical examples. From this work 
certain problem-solving concepts were developed snd 
clarified in an uncluttered atmosphere. Among these 
***** the concepts of heuristic search, problem spaces 
snd stataa, operators (that transformed one problem 
state into another), goal snd subgosl states, means- 
ends analysis, * snd reasoning backwards. The fact 
• 

la particular, some might reasonably claim machine 
vision (or more gsnerslly, perception) snd language 
understanding to be core topics. 
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FIGURE 1 MAJOR SUB-PARTS OF Al SHOWING TIES TO OTHER FIELDS 








































































































that that* uMful Ideas seem ao familiar In AZ re- 
search today taatlf lea to the success of this early 
work. But the very cleanness of pussies allowed re- 
searchers to avoid facing what has turned out to be 
the key problem, namely dealing with knowledge, huge 
amounts of knowledge, diverse, cluttered and inter- 
related. 

Question-answering . As one step toward facing the 
problem of dealing with knowledge, several research- 
ers concentrated on building Inferential question- 
answering systems. (See, in particular, the refer- 
ences listed under SIR, QA2, and QA3 In Chart 1.) 

Such systems should be able to store a large number 
of facta and should be able to respond to reasonable 
questions whose answers could be deduced from these 
facts. These systems required mechanisms for logical 
Inference and led AZ researchers Into a romance with 
logie In general and with Robinson's resolution prin- 
ciple in particular. (See Chart 7.) This line of 
research clarified our concepts of applying Inference 
techniques to enmm on- sense knowledge and led to var- 
ious useful schemes for associative retrieval of 
stored data. Ve also learned that for large 
questlon-anawerlng systems the question of when to 
use Inference methods was more important than the 
nature of the Inference mechanism Itself. Thus, we 
learned that we would need large amounts of secondary 
knowledge about how and when to use the primary 
knowledge of the domain. 

Common- sense reasoning . In 1939, McCarthy proposed 
an ADVICE- TAKER that would be able to accept knowl- 
edge and uae It to deduce answers to questions and 
to figure out simple plans for courses of action. 

One might ask such a system, for example, how to get 
to Timbuktu (a favorite example of McCarthy's). If 

the system knew about airline schedules, airports, 
bow to get to airports, and other common (but Im- 
mensely diverse) knowledge. It might answer thus: 

(1) go to your travel agent and find out about 
flights to Timbuktu, (2) using this information, 
aelect a flight and sake a reservation, (3) drive to 
the airport at the appropriate time, (4) park your 
car, and (3) get on the appropriate airplane. Each 
of these steps, of course, could be expanded in 
detail. 

Problems of this sort are clearly not as claan as 
puzzles; they demand the use of large amounts of 
knowledge; yet they have In common with puzzles the 
feature of planning a course of action to accomplish 

a goal. 

Robotics research (see Chart 9) has probably con- 
tributed the moat to our knowledge of how to generate 
plana based on large amounts of common-sense knowl- 
edge, Researcher! at MIT, using an arm In a domain 
of simple blocks (called the BLOCKS world) and at 
SRZ using a mobile robot in a domain of corridors 
and rooms, have developed verlous reasoning systems 
that can generate plana of action for a robot. Of 
these, we might mention In particular STRIPS, SHRDUJ, 
and HACKER (see Chart 1). 

There has been a lot of useful Internal controversy 
about how to build reasoning systems and about the 
beet directions for research. For a while, there 
was hope in some quarters that some universal sys- 
tem (baaed, for example, like QA3 on Robinson* a 
resolution principle) could be used for all of the 
tasks we have mentioned so far: puzzle-solving. 


quest Ion- answering, and common-sense reasoning. 

First attempts to build euch universal systems were 
unsuccessful In the Incorporation of the necessary 
domain-specific knowledge and techniques and, as far 
as I know, ther« are at present no serious advocates 
of a simple universal system. 

At the opposite extreme of this controversy, however, 
are the proponents of what Z would call ad hoc 1st . 

To them, following any systematic approach la ana- 
thema. Each task should simply be programmed on i ts 
own using whatever tricks might be needed. There is 
no doubt that this kind of opportunism la healthy for 
a growing field at 111 In search of its general prin- 
ciples. Still, the following point must be made 
against rampant ad hoclam: One part of developing a 

science is to discover those concepts thit are im- 
portant. Ve must try to produce intelligent behavior 
out of systems limited to various combinations of 
trial concepts. Our failures tell us whure our 
present concepts are weak and give us hints about new 
ones that might be needed. If our trial concepts are 
always allowed the crutch of ad hoc ism, we do not 
learn enough about where the concepts are weak. 

Another controversy concerns how much knowledge we 
ought to give our reasoning programs. At one ex- 
treme are researchers who insist that the program 
should be given only some basic premises from which 
It must derive any intermediate knowledge It needs to 
arrive at an answer. At the other (and Impossible) 
extreme, programs would be provided explicitly with 
answers to all problems. There are some who feel 
that derivation of answers ultimately will play such 
a large role In Intelligent systems that we may as 
well concentrate now on derivation techniques. To 
force derivation, they tend to work with knowledge- 
impoverished systems. 

The consensus Just now emerging from this controversy 
is that, becauss of combinatoric problems, an intel- 
ligent system probably will be able to make only 
reasonably direct derivations at any stage. Thus, 
to deal with a large domain, such a system must begin 
with a large skeletal network of basic knowledge 
about the domain and knowledge about how to use Its 
knowledge. Any sxcurslon from the known (explicitly 
represented) knowledge into the unknown (derived) can 
thus be well-guided (i.e., practical) even though the 
"volume" of the unknown part itself can be extremely 
large. It Is senselsss to Insist that, to answer s 
single question, an Intelligent system must repeat 
the tedious trial and error evolution of a large part 
of our cultural and scientific knowledge to say 
nothing of possibly having to repeat much of biologi- 
cal evolution Itself. Even the "let’s derive all" 
school would agree. What members of this school and 
some others did not realize was Just how much knowl- 
edge would finally be needed by intelligent systems. 
Given this realization, the only possible course Is 
to build "knowledge-based" programs.* 

2.2.2 Modeling and representation of knowledge 
(Chart 2) 

Our ideas about how to represent knowledge have come 
from several of the applications areas. (Quite 
• 

Minsky (1974) guesses that a knowledge-baaea system 
reasoning about vlauml Images (a system such as 
might be possessed by a typical human) "night need 
a few millions, but not billions, of structural 
units, interconnections, pointers." 
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obviously, every AI program uses some representa- 
tional scheme. Ve cite In Chart 2 just a fee of the 
Important contributions.) Researchers In machine 
vision and perception and in natural language under- 
standing were perhaps the first to realize how auch 
knowledge would be needed by high performance 
programs. These two applications areas have thus 
probably contributed the most to our repertoire of 
representational techniques. 

The systems mentioned in Chart 2 cover soae of the 
aajor suggestions. For example: 

Green (1969a, b,c): Statements in the first order 

predicate calculus. 

Qullllan (1968): Concept nodes in a graph structure 

linked by various relationships, 

Schank et al. (1972): Canonical concept structures 

having ’’slots” for case information. 

Hewitt (1969,71) and Wlnograd (1971): Pattern- 

invoked procedures plus assertions. 

Rulifson et al. (1971): Pattern-invoked procedures 

plus special list structures such as n-tuples, bags 
and sets with property lists all organized in a dis- 
crimination net. 

Newell (1967): Sets of productions organized as 

Harkov tables. 

Minsky (1974): Hierarchically organized structures 

called ’’frame systems.” These have "free variables” 
(analogous to Schank's slots) that can be matched 
against constants occurring in the data to be 
analyzed. 

For a period there was some controversy over whether 
knowledge should be represented assertlonally or pro- 
cedurally. (As an extreme case, a spiral, say, can 
be represented assertlonally by a list of the points 
in the plane through which it passes, or it can be 
represented procedurally by a program that draws it.) 
Something of a cult was made of the ’’procedural em- 
bedding” of knowledge, but this controversy seems to 
be settling down now to an acceptance of the value 
of a combination of aaaertional and procedural 
knowledge. 

Another concern, having antecedents in logic, is how 
to represent certain "modal” concepts involving time, 
necessity, possibility, and so forth, McCarthy & 
Hayes (1969) have analyzed some of the difficulties 
in formalizing the*# concepts; meanwhile, Hendrix 
(1973) and Bruce (1972) have developed systems that 
begin to deal with some of them. 

McCarthy and Hayes (1969) also discuss two funda- 
mental problems concerning representation and 
reasoning. One is called the frame problem , and it 
concerns certain difficulties of model maintenance. 

If have a representation of the world at a cer- 
tain instant (based on observations and a priori 
knowledge), how should we represent and use "laws 
of physics'* to update the model so that it repre- 
sents the world (reasonably accurately) at some fu- 
ture Instant? If a robot removes a book from a 
shelf, can we assume that a door across the room 
remains open without having to derive this fact or 
observe it again? There are several ways of dealing 
with this problem, e.g. , Green (1969), Flkes and 
Nilsson (1971), Sandewall (1972), and Hewitt (1969). 
These are nicely discussed by Hayes (1973), 

Another problem is the qualification problem . If 
a system uses its representation to "prove," say, 
th^t a certain plan will achieve a desired gosl 


(the gosl of being at the airport), how are we to 
deal with. certain difficulties arising when new in- 
formation Is received prior to executing the plan. 
Suppose, for example, someone tells us that our auto- 
mobile is out of gasoline so that now our plan (that 
called for driving to the airport) eill not work. Ve 
had proved that it would, and now new information has 
rendered the proof Invalid even though’ all of the 
information on which the original proof was baaed ia 
•till present. Kayea (1973) discusses this violation 
of the "extension property" and shove the close con- 
nection between the qualification problem and the 
frame problem. System builders (e.g., Hewitt (1969) 
and Rullfaon et al. (1972)] have Invented certain 
constructs that apparently get around these difficul- 
ties, although in a way that ia somewhat unsatis- 
factory to logicians. 

Ve are still quite a way. It seems, from hiving a 
sound theoretical basis for knowledge representation. 
It la ay view that the necessity of developing larga 
and complex reaaoning systems will produce the new 
concepts out of which the needed theories will be 
constructed. 

2.2.3 Heuristic search (Chart 3) 

One of the first results of early AI research was the 
development of a point of view toward problem-solving 
sometimes called "the heuristic search paradigm.” 
There are two cloaely related versions of this 
paradigm. In one, a "problem” la transformed into 
the canonical problem of finding a path through a 
"space” of problem states from the initial state to 
a goal (i.e., solution) state. In the other, a prob- 
lem la "reduced" to various subproblems that are also 
reduced In turn (and so on) until the ultimately re- 
sulting subproblems have trivial or known solutions. 
Each version ia merely a slightly different way of 
thinking about basically the same problem-solving 
process. In each, the process involves generating 
alternative paths toward solutions, setting up cer- 
tain key milestone states (or subproblems), and 
managing search resources wisely to find acceptable 
solutions. 

The word "heuristic" is used because these techniques 
emphasize the use of special knowledge from the prob- 
lem domain that "aids in discovering a solution" by 
drastically reducing the amount of starch that would 
otherwise have to be employed. Often this knowledge 
takes the form of "rules-of- thumb" that help to limit 
or direct the search. Sometimes there are constrain- 
ing relations that can be employed to limit the 
search needed. (A good example of the use of con- 
straints Is the work of Valtz (1972).] 

I have already referred to aome of the heuristic 
search paradigm ideas (subgoals, reasoning backwards, 
and so on) as being basic to common-sense reasoning, 
deduction, and problem solving (Chart 1). Here (in 
Chart 3), we want to cite mainly those aspects of 
heuristic search dealing with the search process it- 
self. Once a problem is represented as a search 
problem, how can a solution be found efficiently? 

The searching occurs in one of two graph structures, 
ordinary graphs (or trees), and AND-OR graphs (or 
trees), depending on whether the problem la viewed 
as one of finding a path to a goal atata or one of 
reducing problems to subproblems, respectively. The 
search technlquas that have been developed (by 
workers in AI, control theory, and ooeratlons 
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research) »r« now co mm only uMd in many AI programs 
and in uny of their applications. Moat of these 
techniques make uao of beurlstlcally-based evaluation 
functions that rank-order the unexplored nodos in the 
graph and thus Indicate where search can aoet effl- 
clently proceed. Furthermore, there arc sou 
theorems (Hart at al. (1968)] a tat In* conditions 
under which these search netbods are guaranteed to 
find optimal paths. The problem of efficiently 
searching a graph has essentially been solved and 
thus no longer occupies Al researchers. This one 
core area, at least, seems to be well under control. 

2.2,4 AI systems and languages (Chart 4) 

The programming languages developed and used by AI 
researchers are Included among the core topics be- 
cause they embody the most useful of the core ideas 
already discussed. Early AI researchers saw the need 
for programs that could store, access, and manipulate 
lists of symbolic information. The means for achiev- 
ing these and other operations were built Into vari- 
ous list processing languages, primarily IPL-V and 
LISP. 

After some years of research using these languages, 
it became apparent that AX systems had a common, re- 
curring need for operations such as- search, 
expression-retrieval, and pattern-matching. The 
next step was to build these operations Into the 
languages themselves. Thus, In the late 1960s, 
another generation of AX languages emerged, languages 
such as QA4 and PLAINER. 

Edward Felgenbmum once characterized progress In AI 
research as progress along the "what-to-how” spectrum 
of computer languages. At the ”ho»" end of this 
spectrum sre the machine languages used by programmers 
who must give the most detailed instructions to the 
computer. As one progresses toward the "what” end, 
the programmer leaves more and more of the details 
of how operations are to be csrrled out to the 
language and can be more and more concerned only with 
what is to be done. AI languages are now moderately 
far along toward the ”what” end, and the proper goal 
of Al research (according to this view) Is to create 
languages even closer to the "what" end. It may well 
be that, ultimately, the field of Al will in large 
part be concerned with the development of superpower- 
ful computing languages. In this light, the best 
way to measure AI progress is to look st the AI 
languages. 

We do not have space here to trace the development 
of AI languages nor to describe the special features 
that they make available to AI researchers. For- 
tunately, there is an excellent tutorial paper by 
Bo brow and Raphael (1973) that gives a very clear 
account of the new languages. 

Currently, a large part of AI research is being con- 
ducted by experimenting with systems written In the 
new languages. The languages provide especially 
powerful mechanisms for representing the extensive 
knowledge needed by present programs. Furthermore, 
this knowledge can now be easily added incrementally 
as the program evolves under the tutelage of human 
experts in the domain. Wlnograd'a (1971) natural 
language understanding system and Valdlnger and 
Levltt*s (1974) system for proving assertions about 
prograsw are good examples of bow the power of these 
languages Is being used. 


It would not be unreasonable to expect that current 
and future experimentation will lead to the crystal- 
lization of additional concepts [such as. perhaps, 
Minsky's (1974) Frame Systems) that will be Incor- 
porated In a new round of Al languages, possibly In 
the late 1970s. 

2,3 First-level applications topics 

2.3.1 Came playing (Chart 3) 

Programs have been written that can play several 
games that humans find difficult. As the most famous 
txampls, we might mention the chess playing program, 
MAC- HACK, of Greeoblatt et al. (1967). A version of 
this program achieved a United States Chess Federa- 
tion rating of 1720 in one tournament. Samuel's 
programs for checkers have beaten experts in the 
game. Several other programs are mentioned in the 
chart. 

Levy (1970) described a program written by Atkins, 
Slate, and Cor land at Northwestern University and 
said that he thought it was stronger than 
Greenblmtt's. He estimated Its rating at about 1730, 
which would make it, he claims, the 500th best player 
In Britain. 

Computer chess tournaments are now held routinely. 
Results of these and other news about computer cheas 
have been rather extensively reported in the SIGART 
Newsletter since 1972. 

Most game playing programs still use rather straight- 
forward tree- searching ideas and are weak in their 
use of high-level strategic concepts. It is gener- 
ally agreed that advancws In thw uaw of strategy and 
In end-game play axe necessary before chess progrsws 
can Become substantially better, and they must be- 
come substantially better before they can beat human 
champions, (World Champion Bobby Fischer la rated 
at ebout 2810.) Levy <1970) la rather pessimistic 
about the rate of future progress in chess and has 
made a £730 bet with Professors McCarthy, Papert, 
and Mlchle that a program cannot beat him in a match 
by August 1978. (Levy's rating In 1970 was 2380.) 

2.3.2 Math, science, and engineering aids (Chart 6) 

The chart lists Just a few examples of AI techniques 
that have been applied in systems that help human . 
professionals. The early AI work on symbolic inte- 
gratlon, together with the work on slgebrsic simpli- 
fication, contributed to a number of systems for 
symbolic mathematical computations. Moses (1971b) 
presents a good review. Systems presently exist 
that can solve symbolically an equation like 
y 2 * - 3y* + 2 ■ 0 (for x), and that can integrate 
symbolically an expression like J(x + e x ) dx. Such 
systems are quite usefully employed in physics re- 
search, for example, In which expressions arise 
having hundreds of terms. 

Another quite successful application is the DENDRAL 
program that hypothesizes chemical structures from 
s combination of mass spectrogram and nuclear mag- 
netic resonance data. The system is presented with 
this data from’a sample of a known chemical compound 
(that IS, Its chemical formula is known). It uses 
several levels of knowledge about chemical struc- 
tures and how they break up In mass spectroscopy to 
infer the structure of the compound. Zt can deal witt. 
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a large number of organic com pound a Including coop lax 
ulnta and estrogenic a taro Ids. Its performance on 
tha s taro Ids of tan axcaads thabast human parfonanca. 

Tbs DENDRAL projact typifies a style of AI systsn 
building that has baan quits successfully applied to 
ebenlstry and some other domains. This design styla 
Involves Intensive Interaction between AI scientists 
and applications area scientists. Tha latter are 
queried In tha nlnutest detail to extract Iron then 
rules and other knowledge that are operationally use- 
ful In the domain. These are then coded Into the 
system by the AI scientists and testa are run to 
Judge their effectiveness. The process Is long and 
involves several Iterations. The applications scien- 
tists are often confronted with apparent contradic- 
tions between how they say they make decisions and 
how they actually make decisions. Few of them have 
any really global or completely accurate theory of 
how they apply their knowledge. Furthermore, this 
knowledge Is often informal and heuristic. As a re- 
sult, the emerging system is a collection of ’'mini- 
theories 1 * and special rules of only local effective- 
ness. To use this design strategy, the system must 
be one that can deal with many, and sometimes con- 
flicting, slnl-theorlea .- It must also be a system to 
which new knowledge can gradually be added and old 
knowledge modified. 

After several months or years of this sort of gradual 
shaping of the system, it comes to simulate the per- 
formance of the human experts whose knowledge It has 
gained. This general strategy Is beginning to be 
employed extensively In AI applications, [ror ex- 
ample, see also Shortllffe et al. (1973).] 

2.3.3 Automatic theorem proving (Chart 7) 

There are three major themes evident in attempts to 
get computer programs to prove theorems In mathe- 
matics and logic. First, early work by AI research- 
ers produced heuristic programs that could prove 
simple theorems in propositional logic and high- 
school level theorems In plane geometry. These pro- 
grams used (but mainly helped to refine) concepts 
like reasoning backwards, means-ends analysis, use 
of subgoals, and the use of a model to eliminate 
futile search paths. The fact that .logicians had 
already developed powerful procedures that effec- 
tively eliminated propositional logic as a domain 
requiring heuristic problem-solving techniques does 
not detract from the value of this early work. 

Logicians were also developing techniques for prov- 
ing theorems in the first order predicate calculus. 

J, A. Robinson (1965) synthesized some of this work 
into a procedure for using a single rule of infer- 
ence, resolution , that could easily be mechanized in 
computer programs. Building resolution-based proven 
quickly became a second theme in automatic theorem 
proving, while other approaches languished. Resolu- 
tion had a great influence on other sppllcatlon areas 
as well (Charts 1 and 8). Performance of the reso- 
lution systems reached impressive, if not superhuman, 
levels. Programs were written that could prove rea- 
sonably complex, sometimes novel, theorems in certain 
domains of mathematics. The best performance, how- 
ever, was achieved by man-machine systems in which a 
•killed human provided strategic guidance leaving the 
system to verify lemmas and to fill In short chains 
of deduction. [See •specially Guard et al. (1969) 
and Allen and Luckhan (1970). The latter system 
ham 1 been used to obtain proofs of new math«matlcal 


results announced without proof In the Notices of the 
American Mathematical Society . ] 

Various strategies were developed to Improve the 
efficiency of the resolution pro vers. These strate- 
gies were mainly based on the form or syntax of the 
expressions to be proved and not on any special knowl- 
edge or semantics of the domain. In automatic theorem 
proving. Just as in other applications areas, semantic 
knowledge was needed to Improve performance beyond the 
plateau reached by the late 1960s. 

The work of Bledsoe and hla students Is typical of 
the third and latest theme in automatic theorem prov- 
ing. Although they emphasize the Importance of man- 
machine systems, their programs themselves hsve’ become 
knowledge-based specialists In certain mathematical 
domains. The use of semantic knowledge in theorem- 
proving systems has also renewed Interest in heuris- 
tics for subgoallng, end so forth. The programs of 
this group are capable of proving some rather Impres- 
sive theorems , and it can be expected that the present 
man-machine systems will produce ever more competent 
and more completely automatic offspring. 

2.3.4 Automatic programming (Chart 8) 

Work in automatic programming has two closely inter- 
related goals. One is to be able to prove that a 
given program acts in a given way; the other is to 
synthesize a program that (provably) will act In a 
given way. The first might be called program veri- 
fication and the second program generation. Work on 
one goal usually contributes to progress toward the 
other; hence, we combine them in our discussion. 

Most of the work on program verification is based on 
a technique proposed by Floyd (1967). (See also Turing 
(1949).] This technique involves associating asser- 
tions with various points in the flow chart of a pro- 
gram and then proving these assertions. Originally, 
the assertions had to be provided by a human, but some 
recent work haa been devoted to generating the asser- 
tlons automatically,. Once proposed, one can attempt to 
have the assertions proved either by a human or by a 
machine. The latter course involves a cloae link be- 
tween this field and that of automatic theorem proving. 

A recent system developed at the Stanford Research 
Institute [Elspss et al. (1973)] is typical of one in 
which the assertions are both produced [Elspas (1972)] 
and proved [waldinger and Levitt (1973)] automatically. 
This system has been used to verify several programs 
including a real-number division algorithm and some 
sort programs. It has also proved theorems about a 
pattern matcher and a version of Robinson's (1963) 
unification algorithm. It la a good example of a 
modern AI program in that it makes effective use of a 
large amount of domain-specific knowledge. 

The closely related work on program generation haa 
succeeded in producing some simple programs. Typical 
of this work is the system of Buchanan and Luckham 
(1974). Broadly viewed, the problem of constructing 
a computer program includes the problem of construct- 
ing a plan, say, for a robot, and thus there are cloae 
links between work in automatic programming, robotics, 
and common- sense reasoning and deduction. 

Suasman'f (1973) HACKER is another system that writes 
simple programs for a limited domain (the BLOCKS 
world). Sussman's goal for HACKER is for it to simu- 
late his own programming style. An important feature 
of HACKER is its strategy of attempting first to 
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writ* a simple "let ' s-hope-that-thia-wlll-do" program, 
and than debugging It until it does succeed at ita 
task. To employ thia strategy , HACKER use* a great 
deal of knowledge about likely clasaoa of program 
bugs and bow to fix than. 

Again, aoaa of tha no at auccaaaful work baa baan In 
connactlon with man-machine systems, We lncluda in 
thla catagory cartaln alda to hunan programmers auch 
as thosa found in tha IMTERLISP ayatan [Teltelman 
(1972a, b, 1973)] . In fact, any tachnlquaa that halp 
naka tha production of prograna sort efficient night 
ba cal lad part of autonatlc programming. Balzar 
(1972) provides a good sunnary of thla broad vlaw of 
tha f laid. 

2.3.9 Robota (Chart 9) 

Every now and than, nan gathers up whatever tech- 
nology happens to ba around and attempts to build 
robots. During tha lata 1960a, research on robots 
provided a central focus for integrating nuch of the 
AX technology. To build an Intelligent robot is to 
build a nodal of nan. Such a robot should have gen- 
eral reasoning ability, locomotive and manipulative 
skills, perceptual (especially visual) abilities, and 
facility with natural language. Thus, robot research 
is closely linked with several other applications 
areas. Xn fact. Most of the research on machine 
vision (Chart 10) was, and Is, being perforated In 
connection with robot projects. 

Our problem-solving and representational techniques 
are probably already adequate to allow useful general 
purpose robot applications; however, such robots 
would ba perceptually impoverished until we develop 
much More powerful visual abilities. Robotics Is s 
particularly good domain in which to pursue the nec- 
essary vision research. 

The robot research of the late 1960s produced systems 
capable of forming and then intelligently executing 
plans of action based on an internal model of the 
world. The Edinburgh, Stanford, HXTAC, and MIT sys- 
tems consisted of manipulator arms and TV cameras 
or other visual Input devices. These became capable 
of building structures out of simple blocks. Xn one 
esse (Stanford), the system could assemble an auto- 
Mobile watsr pump. The Stanford Research Institute 
system consisted of s mobile cart and TV camera (but 
no arm). It could form and execute plans for navi- 
gating through a simple environment of rooms, door- 
ways, and large blocks, and its visual ayatem could 
recognize and locate doorways, floor-wall boundaries, 
and the large blocks. The system hsd sophisticated 
techniques to allow it to recover from errors and un- 
foreseen circumstances, and it could ators (learn) 
generalized versions of the plans It produced for 
future use. 

Since practical applications of gsnerml purpose robot 
systems seem more remote than they do in other appli- 
cations areas, the increasingly pragmatic research 
climate of the early 1970s has seen a lessening of 
activity in general robotics research. In the mean- 
time, various projects with the prscticsl goal of 
advancing industrial automation have begun to apply 
some of the already-developed manipulative and visual 
skills to factory assembly and Inspection problems. 

It seems reasonable to predict that man f s historic 
fascination with robots, coupled with a new round of 
advances in vision and reasoning abilities, will 


lead to a resurgence of Interest In general robot 
systems, perhaps during the .late 1970a. 

2.3.6 Machine vision (Chart 10) 

The ability to interpret visual images of the world 
la adequate enough even In some insects to guide many 
complex behavior patterns. Yet the analysis of 
everyday visual scenes by machine still remains a 
largely unconquered challenge to AI researchers. 

Early work concentrated almost exclusively on design- 
ing systems that could classify two-dimensional 
images into a small number of categories— alpha- 
numeric character recognition, for example. Xn 
fact, much of the AI work during the 1990a was con- 
cerned with pattern recognition. Researchers, such 
as Frank Rosenblatt and Oliver Self ridge, were influ- 
ential la shaping* thla early period. Pattern classi- 
fication (or recognition) continues as a separata 
active research interest, but since about 1963, 

AI interest in vision has centered on the more dif- 
ficult problem of interpreting and describing complex 
three-dimensional scenes. Both aspects, classifica- 
tion and description, are thoroughly and clearly 
treated in an excellent textbook by Duda and Hart 
(1973). 

Much of the scene analysis work can be traced to 
Robert's (1963) influential thesis. It established 
a trend of analyzing scenes composed of prisemtlc 
solids (the so-called "blocks world"). Working with 
these (sometimes complex) scenes composed of simple 
objects helped to establish a wide range of tech- 
niques for converting raw video Images into symbolic 
* descriptions based on concepts such as lines, regions, 
and simple shapes. The MIT "COPY" system, for ex- 
ample , can use a visual Input device to look at a 
scene consisting of a structure of blocks. The sys- 
tem can analyze the scene to form e representation 
of how the blocks are arranged. This representation 
can then later be used (with the robot arm system) to 
reproduce thia exact block structure from disarranged 
blocks. 

Some successful excursions outside the blocks world 
have been made. (See the entries to the right of 
the dashed line in Chart 10). Indeed, many re- 
searchers contend that continuing to work with 
blocks has actually hindered research progress in 
machine vision because it has allowed workers to 
avoid facing certain key problems associated with 
domain semantics, distinguishing features of complex 
objects, and new representational schemes. In any 
case, working with more complex scenes is now well 
established. The spirit of recent work is well de- 
scribed in a note by Tenenbaum (1973). Again, 
knowledge about the domain is crucial to scene inter- 
pretation! 

2,3.7 Natural language systems (Chart 11) 

Processing written and spoken forms of natural 
languages has been the subject of so much research 
that the task of surveying progress in this field 
alone should humble anyone. How, then, are we to 
say anything very helpful tn Juat a few sentences. 

The interested reader should see the GENERAL, volumes 
cited in Chart 11. 

Perhaps a description of the power of some repre- 
sentative systems will suffice. Winograd (1971) 
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Implemented a system that carried on "blocks -wo rid" 
written dialogs like the following: 

Men (typing into e terminal): Which cube la alt ting 

on the table? 

8ystem (generating text on a display): The large 

green one which supports the red pyramid. 

N: Is there a large block behind a pyramid? 

5: Yes, three of them: A large red one, a large 

green cube, and the blue one. 

M: Put a small one onto the green cube which sup- 

ports a pyramid. 

S: OK 

M: Put the llttlest pyramid on top of it. 

S: OK • 

The system demonstrates les understanding of the last 
two commands by having a simulated robot arm carry 
out appropriate actions in a simulated blocks world. 

The work of Schank (1972) typifies a rather success- 
ful trend in natural language understanding. Many of 
the recent systems, in one way or another, attempt to 
match a section of input text or utterance against 
semantically likely stored structures (that are more 
or less complex.) These structures are themselves 
schemas or scenario families having variables that 
are bound to constants In the Input during matching. 
The instantiated scenarios serve as a sort of deep 
structure that represent the meaning of the utter- 
ance. [See also Minsky (1974).] 

The goals of a coordinated scientific effort to pro- 
duce systems to understand limited utterances of 
continuous speech are clearly outlined in s plan by 
Newell et si. (1971). If the goals are met, by 1976 
a prototype system should be able (in the context of 
a limited domain of discourse) to understand (in a 
few times real time) an American (whose dialect is 
not extremely regional) speaking (in a "natural” 
manner) ordinary (although perhaps somewhat simple) 
English sentences constructed from a 1000-word vocab- 
ulary, These projects bring together workers in 
acoustics and speech research as well as in At. The 
projects seem to be more or less on schedule and will 
probably achieve creditable performance by 1976, (In 
the spirit of the vagueness of the phrase "a few 
times real time,” the projects ought to achieve the 
1976 goals at least sometime In the late 1970s.) 

In my opinion, the work in natural language under- 
standing is extremely Important both for Its obvious 
applications and for its future potential contribu- 
tions to the core topics of Al. It is the prime ex- 
ample of a field in which reasonable performance 
could not be achieved by know ledge- impoverished sys- 
tems. We now know that understanders need large 
amounts of knowledge; the challenge is to attempt to 
build some really large systems that have the ade- 
quate knowledge snd to learn, by our mistakes, the 
organizational principles needed to keep these large 
systems from becoming unwieldy. 

2.2.8 Information processing psychology (Chart 12) 

Computer science in general and Al xn particular have 
had a tremendous impact on psychology. They pro- 
vide the concepts and the very vocabulary out of 
which to construct the moat useful theories of human 
behavior. In my opinion the reason that, say, prior 
to 1933, there ’■ere, In fact, no adequate theories 
of human behavi , perception, and cognition is 


because the concepts out of which to construct these 
theories had not yet been formulated. Before we have 
the concepts (and they are now gradually accumulating) 
It Is as impossible to understand human thought as it 
was laposslbls to understand navigation, say, before 
we had the concept of sonar. Man understands the 
world by constructing models, snd his mods Is are often 
baaed on concepts drawn from his technological Inven- 
tions. We may not understand man Immediately after 
building the first robot, but we certainly won't un- 
derstand him before.' (We note in passing that knowl- 
edge about the structure snd function of the neuron — 
or any other basic component of the brain— is irrele- 
vant to the kind of understanding of intelligence that 
we are seeking. So long as these components can per- 
form some very simple logical operations, then It 
doesn't really nattar whether they are neurons, re- 
lays, vacuum-tubes, translators, or whatever.) 

An excellent short account of tbs relationship between 
AX and psychology has been written by Hewell (1970). 
While he, perhaps prudantly, adopts s somewhat less 
extreme position than mine about the dependence of 
psychology on Al, he nevertheless shows how thor- 
oughly information processing ideas have penetrated 
psychological theory. 

Moat of the informatlon-procesaing-based psychology 
to date has been devoted to explaining either memory 
(e.g., EPAM and BAM in Chart 12), perception [e.g*, 
Sternberg (1966)], or problem solving [e.g., Newell 
and Simon (1972)]. Probably the moat complete 
attempt at understanding human problem-solving abil- 
ity la the last-mentioned work of Newell and Simon. 
This volume proposes sn information processing theory 
of problem-solving based on the results of many years 
of research in psychology and Al. 

Animal behavior, while long the special lntersst of 
experimental psychologists, has had little 
informatlon-procesaing-based theoretical attention. 
Some models inspired by ethologists have been pro- 
posed by Friedman (1967). I think that the produc- 
tion system model advanced to explain certain human 
problem solving behavior by Newell (1967) and col- 
leagues might be a starting point for an extensive 
theory of animal behavior. Newell, himself, notes 
that these production systems can be viewed as gen- 
eralizations of stimulus -response systems, [inciden- 
tally, the entire repertoire of what was callsd 
"Intermediate-level actions" of the Stanford Research 
Institute robot system (Raphael et al • 1971) was 
Independently programmed In almost exactly this pro- 
duction formalism. Production systems have been used 
In other AX programs as well.] Newell and Simon 
(1972, p. 803) have also stated that they "have a 
strong premonition that the actual organization of 
human problem solving 'programs closely resembles the 
production system organization ,...** It would teem 
profitable then to attempt to trace the evolutionary 
development of this hypothesized production system 
organization down through some of the higher animals 
st least. 

3, CONCLUSIONS 

In summary, we see that the AX campaign la being 
waged on several different fronts, and that the vic- 
tories, ss well* ss the setbacks, contribute to s 
growing common core of ideas that aspires to be a 
science of Intelligence. Against this background. 



It is worth mentioning some of th# popular critl- 
diaa of AI : 

(1) AI hasn't really don* anythin* y*t. Thar* ar* 
a f*w "toy" programs that play middling chess and 
solve simple puzzles Ilk* "missionaries and canni- 
bals but th* actual accomplishments of AI measured 
against Its proalsas ar* disappointing. [See, for 
example, Drayfus (1963 1 1972).] [My comtnt about 
this kind of criticism Is that Its authors havan't 
ra ally lookad at AI rasaarch past about I960.] 

(2) Not only has AI not achlavad anythin* , but Its 
goals ar* actually lmposslbl*. Thus. AI Is somethin* 
Ilka alchaay. It Is lmposslbl* In principle to pro- 
gram into computers such necessities of Intelligence 
as* N fringe consciousness*' and "perspicuous grouping." 
[Again, see Dreyfus (1963, 1972).] [This kind of 
criticism Is actually rather brave In view of the 
fat* of many previous Impossibility predictions. 

This attack simply looks Ilk* a poor b*t to m*.] 

(3) Th* subject matter of AI , namely intelligence. 

Is too broad. ' It's Ilk* claiming science Is a field. 
[This criticism nay have son* merit.] 

(4) Everything happening In AI could Just as well 
happen In other parts of computer sclenc*, control 
engineering, and psychology. There is really no need 
for this AI "bridge" between already established dis- 
ciplines. [See Llghthlll (1973).] [This kind of 
criticism caused quite a stir in Great Britain re- 
cently. Z think I have shown that th* so-called 
bridge has quite a bit of Internal structure and is 
contributing a heavy traffic of Ideas Into Its 
termini. ] 

(3) AX Is Impossible because it Is attempting to re- 
duc* (to understanding) something fundamentally "ir- 
reducible." Furthermore, this very attempt is pro- 
fane; there ar* certain awesome mysteries In life 
that best remain mysterious. [See Roszak (1972).] 

[My prejudice about this view Is that, at best. It 
Is, of course, nonsense. A blind refusal even to 
attempt to understand la patently dangerous. By all 
means, let us not' foreclose a "rhapsodic understand- 
ing" of these mysteries, but let ue also really 
understand them.] 

(6) AZ la too dangerous, ao It probably ought to be 
abandoned— or at least severely limited, [see 
Veizcnbaum (1972).] [My view la that the potential 
danger of AI , along with all other dangers that man 
presents to himself, will survive at least until we 
have a science that really understands human emotions. 
Understanding these emotions, no less than under- 
standing intelligence and perception, will be an 
ultimate consequence of AI research. Not to under- 
stand them Is to be at their mercy forever, anyway.] 

The one criticism having any weight at all, Z think, 

Is that AI may be too broad and divers* to remain s 
cohesive field. So far, It has stayed together rea- 
sonably well. Whether It begins to fractionate into 
separate exotic applications areas of computer sci- 
ence depends largely, I think, on whether these ap- 
plications continue to contribute cor* Ideas of great 
generality. 

What la th* status of these core Ideas today? There 
ar* two extreme views. 1 have heard John McCarthy 
say (perhaps only provocatively to students) that 
really intelligent programs are a long way off and 
that when we finally achieve them they will be based 
on Ideas that aren't around yet. Their builders 
will look back at AI In 1974 as being s period of 
pre-history of th* fl*ld. 


On th* other hand, what If w* already have most of th* 
Ideas that we ar* going to got, ld*as Ilk* millions of 
coord Ins ted mini -theories, procedural embedding of 
knowledge , associative retrieval, and scenario frames. 
Suppose that we have now only to devote th* large ef- 
fort required to build really huge Intelligent systems 
based on these ideas. To my knowledge, no one advo- 
cates this alternative view, but consider this: What- 

ever the nature of an Intelligent system, It will be 
exceedingly complex. Its performance will derive in 
large part from Its complexity. V* will not be sure 
that AI la ready to build a large, intelligent system 
until after w* have don* so. The elegance of th* 
basic ideas and the new and powerful languages alone 
will not be sufficient indication of our maturity. 

At some time, w* will have to put together exceedingly 
complex systems. The time at which It is appropriate 
to try will always be a guess. 

My guess Is that we still have a good deal of work to 
do on th* problem of bow to obtain, represent, coor- 
dinate, and us* the extensive knowledge we now know 
Is required. But these ideas will not com* to those 
who merely think about the problem. They will com* 
to those who both think and experiment with much 
larger systems than w* have built so far. 

Another problem, of a more practical type, concerns 
knowledge acquisition. Today, the knowledge in s pro- 
gram must be put in "by hand" by the programmer al- 
though there are beginning attempts at getting 
programs to acquire knowledge through on-line Inter- 
action with skilled humans. To build really large, 
knowledgeable systems, we will have to "educate" ex- 
isting programs rather than attempt th* almost impos- 
sible feat of giving birth to already competent ones. 
[Some researchers (e.g., Papert, 1972) expect that at 
leaet some of th* principles we discover for educating 
programs will have an impact, perhaps revolutionary, 
on how we educate people-.] 

In this connection, w* have alresdy mentioned that 
several successful AI systems us* a combination of man 
and machine to achieve high performance levels. I 
expect this research strategy to continue and to pro- 
vide the setting In which the human expert (a) can 
gradually transfer skills to th* machine, [woods and 
Uakhoul (1973) consciously apply a strategy such as 
this and call it "Incremental simulation,"] 

I have not yet mentioned In this paper the subject of 
learning. It is because 1 have come to agree with 
John McCarthy that ww cannot havw s program learn a 
fact before w* know how to tell It that fact and be- 
fore th* program knows how to us* that fact. V# have 
been busy with telling and using facts. Learning 
them Is still in the future, although some isolated 
successes have, In fact, occurred. [See especially, 
Samuel (1959, 1967), Winston (1970), Flkes *t al. 
(1972a), and Susaman (1973).] 

Continuing our dlacusalon of th# likely future of AI , 
we note that the increasingly pragmatic attitude of 
those who have been sponsoring AI research will have 
a great effect on the course of this research. There 
may even be s temporary reduction of effort by AI re- 
searchers in the core topics and th# first-level ap- 
plications areas In favor of Increase, support of 
engineers and scientists building second-level appli- 
cations. The results of these second-level efforts 
may, In fact, be rather spectacular. 1 have In mind 
such things ms automated factories, automatic robots 
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for factor!** and warehouses, medical diagnosis sys- 
tems, systems that will automata a largo amount of 
office work, local aids, teaching aids, interactive 
software production systems, and so on. [rirseheln 
et al. (1973) , make some predictions about when these 
and other intelligent systems may come.] 

The short range result of this Increased pragmatism 
may tend to fractionate the field. In the long run, 
though, if there really are many more core ideas to 
be discovered, these technological efforts will stim- 
ulate their discovery, provided that a sufficient 
level of basic Investigation continues. 

In closing, 1 have one final prediction. Aa AZ suc- 
cesses grow, so will the criticisms of AX, especially 
from those who are certain that intelligence cannot 
be mechanized. These critics, having been forced out 
of various mystical trenches in the past, will be 
especially vigorous In their defense of what little 
ground remains to them. The ensuing debates will have 
the crucially Important side effect of getting us all 
to consider how we want to use and control our new 
Intellectual powers. I hope that society assesses 
these powers accurately and is not lulled by certain 
otherwise well-meaning humanists Into believing that 
Artificial Intelligence is not real. 
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Each entry has a code symbol or symbols associated 
with one or more of the twelve subheadings of AI that 
we have discussed in the paper. These symbols are: 

DSD * Common-Sense Reasoning, Deduction, and Problem 
Solving; REP a Modeling and Representation of Knowl- 
edge; SEARCH a Heuristic Search; SYS a AZ Systems and 
languages; GAME a Game Playing; AIDS a Hath, Science, 
and Engineering Aids; TP a Automatic Theorem Proving; 
PROG ■ Automatic Programming; ROB a Robots; YIS a 
Machine Vision; LANG a Natural Language Systems; 

PSYC « Information Processing Psychology. A prefix 
"-C" after a symbol means that the reference contains 
a general discussion or survey. 

The code symbol "GEN” identifies the reference as 
being general to the whole field of AI. These general 
references are: 


Collins and Mlchle (1968) 

Dale and Michie (1968) 

Dreyfus (1963, 1972) 
Feigeribau* (1963, 1969) 
Felgenbaum and Feldman (1963) 
Firschcln, et al. (1973) 

Hunt (1974) 

Jackson (1974) 

Ligh thill (1973) 

Ueltzer and Michie (1969, 
1970, 1971, 1972) 

Michie (1968) 


Minsky (1961, 1963, 
1968) 

Newell (1973) 

Papert (1968) 
Papert (1972) 
Roszak (1972) 

Simon (1969) 

Slagle (197;) 
Solomonoff (1966) 
Turing (1930) 
Weizenbaum (1972) 


REFERENCES 

Agin, G. (1973), "Representation and Description of 
Curved Objects," Pb.D. thesis, Elec. Eng. Dept., 
Stanford Unlv., Stanford, CA, July 1972. Available 
as Stanford Artificial Intelligence Laboratory Memo 
AIM-173, Oct. 1972. (VIS) 

Agin, Gerald J. and Blnford, Thomas 0. (1973), 
"Computer Description of Curved Objects," Adv. 

Papers 3d Inti. Conf. on Artificial Intelligence . 
Stanford Unlv., Stanford, CA, Aug. 1973. (VIS) 
Allen, John and Luekham, David (1970), "An Inter- 
active Theorem-Proving Program," Machine Intelli- 
gence, Vol. 5, 321-336, 1970. (TP) 

Amarel, S. (1968), "On Repre sen tat lone of Problems of 
Reasoning About Aetlons," Machine tntelllfence . 

Vol. 3, D. Michie, ed., 131-170, Edinburgh Unlv. 
Press, Edinburgh, 1968. (REP) 

Amarel, S. (1969), "On the Representation of Problems 
and Goal Directed Procedures for Computers," Comm. 
Am. Soc. Cybernetics , Vol. 1, No. 2, 1969. (SEARCH) 
Ambler, A. P. et al. (1973), "A Versatile Computer- 
Controlled Assembly System," Adv, Papers 3d lntl. 
Conf. on Artificial Intelligence , Stanford Unlv., 
Stanford, CA, Aug. 1973. (ROB, VIS) 

Anderson, J.R. and Bower, G. H. (1973), Human 
Associative. Memory . V. H. Winston and Sons, 
Washington, D.C., 1973. (PSYC) 

Andrews, P. B. (1968), "Resolution with Merging," 

J. ACM . Vol. 13, 367-381, 196S. (TP) 

Balzer, R. M. (1972), "Automatic Programming," 
Institute Technicsl Memo, Unlv. of So. Calif./ 
Information Sciences Institute, Sept. 1972. 

(PR0G-G) 

Banerji, R. B. (1969), Theory of Problem Solving , 
American Elsevier Publishing Company, New York, 

1969. (GAME) 

Banerji, R. B. and Ernst, G. W. (1972), "strategy 
Construction Using Homomorphism* Between Games," 
Artificial Intelligence , Vol. 3, No. 4, 223-249, 
Winter 1972. (CAME) 

Barnett, Jeffrey (1972), "A Vocal Data Management 
System," 1972 Conf. on Speech Communication and 
Processing, Newton, MA, 24-26 April 1972, U. S. Air 
Force Cambridge Research Laboratories, Bedford, MA, 
22 Feb. 1972, 340-343 (AFCRL-72-0120, Special Re- 
ports Number 131). (LANG) 

Barrow, H. and Popplestone, R. (1971), "Relational 
Descriptions in Picture Processing," Machine 
Intelligence , Vol. 6, B* Meltzer and D. Michie, 
•da., 377-396, Edinburgh Unlv. Press, 1971. (VIS) 
Barrow, H. G. and Crawford, G. F. (1972), "The Mark 
l.S Edinburgh Robot Facility," Machine intelli- 
gence, Vol. 7, B. Meltzer and D. Mlchle, eds. , 
463-480, American Elsevier, 1972. (ROB) 

Barrow, H. G. , Ambler, A. P. , and Burstall, R. M, 
(1972), "Some Techniques for Recognizing Struc- 
tures In Pictures," Frontlwrs of Pattern Recogni- 
tion , S. Watanabe, ed. , 1-29, Academic Press, New 
York, 1972. (VIS) 

Bartlett, F. C. (1938), Thinking , Basic Books, New 
York. (PSYC) 

Bartlett, F. C. (1932), Remembering , Cambridge Unlv. 

Press, Cambridge. (PSYC) 

Becker, J. D. (1970), "An Information- Processing 
Model of Interned late- Lev# l Cognition," Memo No. 

119, Stanford Artificial Intelligence Project, 
Computer Science Dept., Stanford Unlv., Stanford, 
CA. Also Report No. 2333, Bolt, Beranek, and 
Newman, Inc*., Cambridge, MA. (PSYC) 


D-1S 



Becker, J. 0. ( 1973) , **A Model for the Encoding of 
Experiential Information, " Computer Model* of 
Thought end Language , Schenk end Colby, eds., W. H. 
Freemen and Co., Sen Francisco, 1973. (PSYC) 
Berlekamp, E. (1963), "Program for Double- Dummy 
Bridge Problems, A Strategy for Mechanical Game 
Playing,** J. API , Vol. 10, Mo. 3, 337-364, July 
1963. (GAME) 

Berliner, Hans (1970), "Experiences Gained In Con- 
structing and Testing a Chess Program,** Proc. IEEE 
Systems Science and Cybernetics Conf., 316*223, 
.Pittsburgh, PA, Oct. 1970. (GAME) 

Berliner, Hans J. (1973), "Some Necessary . Condi t ions 
for a Master Chess Program,** Adv. Papers 3d Inti. 
Conf. on Artificial Intelligence . Stanford Unlv., 

• Stanford, CA, Aug. 1973. (GAME) 

Bernstein, A. et al. (1939), **A Chess Playing Program 
for the IBM 70-4,** Proc, Western Joint Comp. Conf. 
AIEE , 137-139, Mar. 1939. (GAME) 

Blair, F. •¥. , Grlesmer, J. H. , and Jenks, R. D. 
(1970), **An Interactive Facility for Symbolic 
Mathematics,** Proc. Inti. Comp. Symposium , 394-419, 
Bonn, Germany, 1970. (AIDS) 

Bledsoe, V. V. (1971), "Splitting and Reduction 
Heuristics In Automatic Theorem Proving," Artifi- 
cial Intelligence . Vol. 2, No. 1, 55-77, Spring 

1971. (TP) 

Bledsoe, T. V., Boyer, R. S., and Hcnneman, V. H. 
(1972), "Computer Proofs of Limit Theorems," 
Artificial Intelligence , Vol. 3, 27-60, 1972. (TP) 
Bledsoe, w, V. and Bruei, P. (1973), "A Man-Machine 
Theorem Proving System," Adv. Popera 3d Inti. Conf. 
on Artificial Intelligence , Stanford Unlv., 
Stanford, CA, 1973. (TP) 

Bobrow, D. G. (1964s), "Natural Language Input for a 
Computer Problem-Solving System," Doctoral disser- 
tation, MIT, Sept. 1964. Reprinted In Semantic 
Information Processing , U. Minsky, ed. , MIT Press, 
Cambridge, UA, 196S. ( LANG) 

Bobrov, D. G. (1964b), **A Question-Answering System 
for High- School Algebra Word Problems," Proc. AFIPS 
Fall Joint Comp. Conf. , 591-614, 1964. (LANG) ~~ 
Bobrow, D. G. and Fraser, J. B. (1969), "An Augmented 
State Transition Network Analysis Procedure," 

Proc. Inti. Joint Conf, on Artificial Intelligence , 
337-367, Washington, D.C., 1969. (LANG) 

Bobrow, D. G. and Raphael, B. (1973), "New Pro- 
gramming Languages for AX Research," SRI Artifi- 
cial Intelligence Center Tech. Note 82, Stanford 
Research Institute, Menlo Park, CA, Aug. 1973. To 
appear In Computer Surveys . 1974. (SYS-C) 

Bobrow, D. G. and Wegbrelt, B. (1973s), "A Model for 
Control Structures for Artificial Intelligence 
Programming Languages," Adv. Papers 3d Inti. Conf. 
on Artificial Intelligence , Stanford Unlv., 
Stanford, CA, Aug. 1973. (SYS) 

Bobrow, 0. G. and Wegbrelt, B. (1973b), "A Model and 
Stack Implementation of Multiple Environments," 
CAOI , Vol. 16, No. 10, Oct. 1973. (SYS) 

Bolles, R. and Paul, R. (1973), "The Use of Sensory 
Feedback in a Programmable Assembly System," Memo 
CS-396, Comp. Scl. Dept., Stanford Unlv., Artifi- 
cial Intelligence Lab. Memo AIM-220, Oct. 1973. 
(ROB) 

Boyer, Robert S, and Moore, J Strother (1973), 
"Proving Theorems About LISP Functions," Adv. 

Papers 3d Inti. Conf. on Artificial Intelligence , 
Stanford Unlv., Stanford, CA, Aug. 1973. (PROG) 
Brice, C. R. and Fennema, C. L. (1970), "Scene 
Analysis Using Regions," Artificial Intelligence, 
Vol. 1, No. 3, 203-228. (VIS) 


Bruce, Bertram (1972), "A Model for Temporal Refer- 
ences and Its Application in a Question Answering 
Program," Artificial Intelligence. Vol. 3, 1-26, 

1972. (REP) 

Bruner, J. S., Good now, J. J. , add Austin, G. A. 
(1936), A Study of Thinking, Wiley, New York. 

(PSYC) 

Buchanan, B. G. , Sutherland, G. , and Feigenbaum, E. 
(1969), "Heuristic DENDRAL: A Program for Gener- 

ating Explanatory Hypotheses in Organic Chemistry," 
Machine Intelligence , Vol. 4, B. Mel tier and D. 
Mlchle, eds. , 209-234, American Elsevier Publishing 
Company, New York, 1969. (AIDS) 

Buchanan, B. G. and Lederberg, J. (1971), "The 

Heuristic DENDRAL Program for Explaining Empirical 
Data," Proc. IF1P Congress , Vol. 71, Ljubljana, 
Yugoslavia, 1971. Also Stanford Unlv. AIM 141. 
(AIDS) 

Buchanan, J. R. and Luckham, D..C. (1974), "On 

Automating the Construction of Programs," Stanford 
Artificial Intelligence Lab. Memo, forthcoming 
1974. (DED, PROG) 

Bundy, Alan (1973), "Doing Arithmetic with Diagrams," 
Adv. Papers 3d Inti. Conf. on Artificial Intelli- 
gence , Stanford Unlv., Stanford CA, Aug. 1973. (TP) 
Burstall, R. M. , Collins, J. S. , and Popplestone, 

R. J. (1971), Programming In POP2 , 279-282, 
Edinburgh Unlv. Press, Edinburgh, 1971. (SYS) 
Carbonell, J. R. (1971), ’*AI In CAI: An Artificial 

Intelligence Approach to Computer-Ass la ted 
Instruction," IEEE Trans, on Man-Machine Systems , 
Vol. MMS-11, No. 4, 190-202, Dec. 1970. (AIDS) 
Chang, C. L. and Slagle, J. r. (1971), "An Admissible 
and Optimal Algorithm for Searching and/or Graphs," 
Artificial Intelligence , Vol. 2, 117-128, 1971. 
(SEARCH) 

Chang, C. L. and Lee R. C. (1973), Symbolic Logic 
and Mechanical Theorem . Proving , Academic Press, 

1973. (TP-G) 

Cherniak, E. C. (1972), "Toward a Model of Children's 
Story Comprehension," Al TR-266 , MIT, Cambridge, 

UA. (LANG, DED) 

Chase, W. G. (1973), Visual Information Processing , 
Academic Press, 1973. (PSYC-G) 

Chomeky, N. (1956), "Three Models for the Descrip- 
tion of Language," IRE Trans, on Info. Theory , 

Vol. IT-2(3), 113-124, 1956. (PSYC, LANG) 

Chomsky, N. (1957), Syntactic Structures , Mouton, 

The Hague, 1957. (PSYC, LANG) 

Chomsky, N. (1965), Aspects of the Theory of Syntax , 
MIT Press, Cambridge, UA, 1965. (LANG) 

Clowes, M. G. (1971), "On Seeing Things," Artifi- 
cial Intelligence , Vol. 2, Nu. 1, 79-116, 1971. 
(VIS) 

Colby, K. U. and Enea, H. (1967), "Heuristic Methods 
for Computer Understanding of Natural Language in 
Context Restricted On-Line Dialogues," Mathematical 
Blosclences , Vol. 1, 1-25, 1967. (LANG) 

Colby, K, M. , Weber, S, and Hilf, F. D. (1971), 
"Artificial Paranoia," Artificial Intelligence , 

Vol. 2, No. 1, 1-25, Spring 1971. (PSYC) 

Coles, L. S. (1970), "An Experiment in Robot Tool 
Using," Stanford Research Institute Tech. Note 
No. 41, Stanford Research Institute, Menlo Park, 

CA, Oct. 1970. (ROB) 

Coles, L. S. (1972), "Techniques for Information 
Retrieval Using an Inferential Question-Answering 
System with Natural-Language Input," SRI Artifi- 
cial Intelligence Center Tech. Note 74, Stanford 
Research Institute, Menlo Park, CA, Nov. 1972. 
(LANG) 


0-16 



Colllnt, A. M, and Quilllan, U. R, (1972), "Retrieval 
Time from Semantic Memory," J. Verbal Learning and 
Verbal Behavior , Vol. 9, 240-247, 1969. (PSYC) 
Collins, N. and Mich la, 0., ads. (1968), Machlna 
Intelligence . Vol. 1, Avar lean Elsevier Publishing 
Company, Haw York, 1967. (GEN) 

Coray, E. J. (1969), "Computer- Assisted Daslgn of 
Complex Organic Synthaals," Science , 10 Oct. 1969. 
(AIDS) 

Dsla, E. and Mlchle, D. , ads. (1968), Machine 

Intelligence , Vol. 2, American Elsavlar Publishing 
Cosipany, New York, 1968. (GEN) 

Darlington, J. L. (1971), "A Partial Mechanization of 
Second-Order Logic,” Machlna Intalllganca , Vol. 6, 
91-100,. B. Mai tzar and D. Michla, ads., Edinburgh 
Univ. Prass, Edinburgh, (TP) 

Davlaa, D. J. M. (1971), "POPLER: A POP-2 Planner,” 

llano M1P-R-89, School of Artificial Intalllganca, 
Univ. of Edinburgh. (SYS) 

Davis, U. and Putnam, H. (1960), "A Computing Proce- 
dure for Quantification Theory,” J. ACM , Vol. 7, 
201-215, 1960, (TP) 

Darksan, J. , Rullfson, J. F., and Wald In gar, R. J. 
(1972), "The QA4 Language Applied to Robot 
Planning,” AF1PS Conf. Proc. , Vol. 41, Part 11, 
1181-1187, Fall Joint Comp. Conf., 1972. (DED) 
Dautsch, L. P. (1973), "An Interactive Program Veri- 
fier, " Ph.D. thesis, Dept, of Computer Science, 

Univ. of California, Berkeley, 1973. (PROG) 

Doran, J. and Ulchle, D. (1966), "Experiments with 
the Graph Traverser Program," Proc. Roy. Soc. A , 

Vol. 294, 235-259, 1966. (SEARCH) 

Dreyfus, H. (1965), "Alchemy and Artificial Intelli- 
gence,” RAND Corporation Paper P3244 (AD 625 719), 
Dec. 1965. (GEN) 

Dreyfus, H. L. (1972), What Computers Can* t Do , 

Harper and Roe, 1972. (GEN) 

Duda, R. and Hart, P. (1970), "Experiments in Scene 
Analysis,” Proc. 1st Natl. Symposium on Industrial 
Robots , 11T Research Institute, Chicago, IL, 

Apr. 1970. (VIS) 

Duda, R. and Hart, P, (1973), Pattern Clasalf lcatlon 
and Seen* Analysis , John Wiley It Sons, New York, 
1973. (VIS-G) 

Eastman, C. M. (1971s), ’*GSP: A System for Computer 

Assisted Spsce Planning," Proc. 8th Annual Design 
Automation Workshop , Atlantic City, NJ. (AIDS) 
Eastman, C. M. (1971b), "Heuristic Algorithms for 
Automated Spsce Planning," Proc. 2d Inti, Joint 
Conf. on Artificial Intelligence , Imperial College, 
London, 1971. (AIDS) 

Edmundion, H. P. , ed. (1961), Proc, of the Natl. 
Symposium on Machine Translation , Prentice-Hall, 
Englewood Cliffs, NJ, 1961. (LANG) 

Edwards, D. snd Hart, T. (1961), "The Alpha-Beta 
Heuristic," MIT Artificial Intelligence Memo No. 

30 (revised), 28 Oct. 1963. Originally printed as 
"The Tree Prunw (TP) Algorithm,” 4 Dec. 1961. 

(CAME) 

Ejiri, U. et si. (1971), "An Intelligent Robot with 
Cognition and Decision-Making Ability," Proc. of 2d 
Inti. Joint Conf. on Artificial Intelligence , 
Imperial College, London, 350-358, 1971. (ROB, VIS) 
Ejiri, E. et al. (1972), "A Prototype Intelligent 
Robot that Assembles Objects from Plan Drawings,” 
IEEE Trans. Comp. , 161-170, Feb. 1972. (ROB, VIS) 
Elcock, E. W. et si. (1971), "ABSET, A Programming 
Language Based on Sets: Motivation and Examples,” 

Machine Intelligence , Vol. 6, Edinburgh Univ. Press, 
Edinburah, 1971. (SYS) 


Elspas, B, (1972), "The Semiautomatic Generation of 
Inductive Assertions for Program Correctness 
Proofs," Report No. 55, Seminar, Dea Instltuts fur 
Theorie der Automatan und Schaltnetzwerke, 
Gesellschaft fur Mathematlk und Datenverarbei tung, 
Bonn, 21 Aug. 1972. (PROG) 

Eiapas, B. et al. (1973), "Design of an Interactive 
System for Verification of Computer Programs," 

SRI Report, Project 1891, Stanford Research 
Institute, Menlo Park, CA, July 1973, (PROG) 

Enea, H. and Colby, K.. M. (1973), "idlolectric 

Language-Analysis for Understanding Doctor-Patient 
Dialogue*," Adv. Papers 3d Inti. Conf. on Artificial 
Intelligence , 278-284, Stanford Univ., Stanford, CA, 
1973. (LANG) 

Engelman, C. (1969), "MATH LAB 68,” Information 
Processing 68, A. J. H. Morrell, ed. , 462-467, 

North- Holland Publishing Company, Amsterdam, 1969. 
(AIDS) 

Ernst, C. W. and Newell, Allen (1969), GPS: A Case 

Study In Generality and Problem Solving . Academic 
Press, New York, 1969. (DED) 

Ernst, G. W. (1971), "The Utility of Independent 
Subgoals in Theorem Proving," Information and 
Control . Apr. 1971. (TP) 

Ernst, H. (1961), "MH-1, A Computer-Operated Mechani- 
cal Hand,” D, Sc. dissertation, Dept, of Elec. Eng., 
MIT, Cambridge, MA. (ROB) 

Fahlman, S. (1973), ”A Planning System for Robot 
Construction Tasks," Report Al TR- 283, Artificial 
Intelligence Lab., MIT, Cambridge, UA, May 1973. 
(DED) 

Falk, G. (1970), "Computer Interpretation of Im- 
perfect Line Data as a Three-Dimensional Scene," 
Ph.D. thesis, Stanford Univ., Comp. Sc l. Dept., 

1970. Available as CS-180 and AIM-132. (VIS) 

Falk, G. (1972), "interpretation of Imperfect Line 
Data as a Three Dimensional Scene," Artificial 
Intelligence , Vol. 3, No. 2, 101-144, 1972. (VIS) 
Feigenbaum, E. A. (1961), "The Simulation of Verbal 
Learning Behavior," Proc. Western Joint Comp. Conf. , 
121-132, 1961. Also in Computers and Thought , 

E. A. Feigenbaum and J. Feldman, eds. , 297-309, 
McGraw-Hill Book Company, New York, 1963. (PSYC) 
Feigenbaum, E. (1963), "Artificial Intelligence 
Research," IEEE Trans. Info. Theory , Vol. IT-9. 

No. 4, 248-261, Oct. 1963. (GEN) 

Feigenbaum, E. (1969), "Artlfical Intelligence: 

Themes in the Second Decade," Information Processing 
68, Vol. 2,A.J.H. Morrell, ed., 1008-1022, North- 
Holland Publishing Company, Amsterdam, 1969, Also 
printed as Stanford Univ. Artificial Intelligence 
Project Memo No. 67, IS Aug. 1968. (GEN) 

Feigenbaum, E. and Feldman, J. , eds. (1963), Computer* 
snd Thought , McGraw-Hill Book Company, New York, 
1963. (GEN) 

Feldman, J. A. et al. (1969), "The Stanford Hand-Eyw 
Project," Proc. 1st Inti., Joint Conf, on Artificial 
Intell Igwncw , 521-326, Washington, D.C., 1969. 

(ROB) 

Feldman, J. A. et al. (1971), "The Use of Vision and 
Manipulation to Solve the Instant Insanity Puzzle," 
Proc. 2d Inti. Joint Conf. on Artificial Intelli- 
gence , London, 1971. (ROB, VIS) 

Feldman, J. A. et al. (1972), "Recent Developments in 
SAIL— An ALGOL-Based Language for Artificial 
Intelligence," 1972 FJCC Proc. , 5-7 Dec. 1972, 
Anaheim, CA. (SYS) 

Feldman, J. A*, and Rovner, P. D. (1969), "An ALGOL- 
Baaed Associative Language," Comm. ACM, 434-449, 

Aug. 1969. (SYS) 




D-17 


Fikes, R. E. (1968) , "A Heuristic Program for Solving 
Problem* Stated aa Nondetermlnistlc Procedure*/* 
Ph.D. the*la # Cam* gle-Me lion Univ. , 196S. (DED, 
SYS) 

Fikes, R. £. (1970), "REF-ART: A System for Solving 

Problems Stated aa Procedures,** Artificial Intel- 
ligence , Vol. 1(1), 1970. (DED, SYS) 

Fikes, R. E. and Nilsson, N. J. (1971), "STRIPS: A 

He* Approach to the Application of Theorem Proving 
in Problem Solving," Artificial Intelligence , Yol. 
2, 189-208, 1971. (REP, DEO) 

Flkea, R. E., Hart, P* E., and Nilsson, N, J. 

(1972a), "Learning and Executing Generalized Robot 
Plana," Artificial Intelligence , Vol. 3, 231-288, 
-1972. (DED) 

Flkea, R. E. , Hart, P. £., and Nilsson, N. J. 

(1972b), "Some Nee Directions in Robot Problem 
Solving," B. Meltzer and D. Mlchle, eda., Machine 
Intelligence , Vol. 7, Edinburgh Univ. Press, 
Edinburgh, 1972. (ROB-G) 
rirscheln, Oscar et al. (1973), "Forecasting and 
Assessing the Impact of Artificial Intelligence on 
Society," Adv. Paper* 3d Inti. Conf. on Artificial 
Intelligence , Stanford Univ., Stanford, CA, Aug. 
1973. (GEN) 

Fischler, Martin A. and Elschlager, Robert A. (1973), 
"The Representation and Matching of Pictorial 
Structures," IEEE Trans, on Computers , Yol. C-22, 
No. 1, 67-92, Jan. 1973. (VIS) 

Flanagan, J. L. (1965), Sp eech Analysts. Synthesis 
and Perception. Academic Press, New York. (LANG) 
Floyd, R. W. (1967), "Assigning Meanings to Pro- 
grams," Proc, of a Symposium in Applied Mathe- 
matics , Vol. 19, J. T. Schwartz, ed., Am. Math. 
Soc., 19-32, 1967. (PROG) 

Forsen, 0. (1968), "Proceaalng Visual Data with an 
Automaton Eye," Pictorial Pattern Recognition , 
471-502, Thompson Book Co., Washington, D.C., 1968. 
(VIS) 

Friedman, Joyce (1971), A Computer Model of Trans- 
formations! Grammar , 166, American Elsevier Pub- 
lishing Company, New York, 1971. (LANG) 

Friedman, L. (1967), "instinctive Behavior and Its 
Computer Synthesis," Behavioral Science , Vol. 12, 
No. 2, Mar. 1967. (PSYC) 

Fuller, S., Gaschnig, J, , and Glllogly, J. (1973), 
"Analysis of the Alpha-Beta Pruning Algorithm," 
Camegle-Mellon Univ,, Dept, of Comp. Scl. Report, 
Pittsburgh, PA, July 1973. (GAME) 

Gelsrnttr, H. (1960), "Realization of a Geometry 
Theorem-Proving Machine," Proc. 1959 Inti. Conf. 
on Info. Proc ., 273-282, UNESCO, Paris, Also in 
Computers and Thought , E, A. Felgenbaum and J. 
Feldman, eds., 134-152, McGraw-Hill Book Company, 
New York, 1963. (TP, SEARCH) 

Gelernter, H. , Hansen, J. R. , and Loveland, D. W. 
(1963), "Empirical Explorations of the Geometry 
Theorem Machine," Computers and Thought , E. A. 
Feigenbaum and J. Feldman, eds,, 153-163, McGraw- 
Hill Book Company, New York, 1963. (TP) 

Gere, W. S. (1966), "Heuristics in Job Shop 

Scheduling," Management Science , Vol. 13, 167-190. 
(AIDS) 

Glllogly, J. (1972), "The Technology Chess Program," 
Artificial Intelligence , Vol. 3, No, 3, 145-163, 
rail 1972. (GAME) 

Goldstein, A. Jay, Harmon, Leon D. , and Lesk, Ann B. 
(1971), "Identification of Human Faces," Proc. 

IEEE , Vol. 39, No. 5, 748-760, May 1971. (VIS) 
Good, I. J. (1967), "A Five-Year Plan for Automatic 
Chess," Machine Intelligence , Vol. 2, E. Dale and 
D. Mlchle, eds., 89-118, Edinburgh Univ. Press, 
Edinburgh, 1967. (GAME) 


Good, D. I. and London, R. L. (1968), "interval 
Arithmetic for the Burroughs B3500: Four ALGOL 

Procedures and Proofs of Their Correctness," Comp. 
Scl. Tech. Report No. 28, Univ. of Wisconsin, 1968. 
(PROG) 

Gordon, G. (1969), System Simulation . Prentice-Hall, 
Englewood Cliffs, NJ, 1969. (SYS) 

Grape, G. (1973), "Model Based (Intermediate Level) 
Computer Viston," Ph.D. thesis, Comp. Scl. Dept., 
Stanford Univ., Stanford, CA, 1973. (VIS) 
Greeiiblatt, R. et si. (1967), "The Greenblatt Cheis 
Program," Proc, AFIPS Fall Joint Comp, Conf, , 801- 
810, 1967. (GAME) 

Green, B. F. et si. (1961), "Baseball: An Automatic 

Question Answerer," Proc. Western Joint Comp. Conf. , 
219-224. TLANG) 

Green, C. (1969a), "Theorem-Proving by Resolution as 
s Basis for Quest ion- Answer lag Systems," Machine 
Intelligence . Vol. 4, B. Ueltzer and D. Mlchle, 
eda., 183-205, American Elsevier Publishing Company, 
New York, 1969. (REP, DED) 

Green, C. (1969b), "The Application of Theorem- Proving 
to Quest Ion- Answering Systems," Doctoral disserta- 
tion, Elec. Eng. Dept., Stanford Univ., Stanford, 

CA, June 1969. Also printed as Stanford Artificial 
Intelligence Project Memo AI-96, June 1969. (REP, 
DED, PROG, TP) 

Green, a C. (1969c), "Application of Theorem- Proving 
to Problem Solving," Proc. Inti. Joint Conf, 
Artificial Intelligence . Donald E. Walker and 
Lcwia M. Norton, eda., Washington, D.C. , May 1969. 
(REP, DED) 

Gregory, R. L. (1966), Eye and Brain , McGraw-Hill 
Book Company, New York. (PSYC) 

Gregory, R. L. (1970), The Intelligent Eye , Weldenfeld 
and Nlcolaon, London, 1970. (PSYC) 

Grlesmcr, J. H. and Jenki, R. D. (1971), "SCRATCHPAD/ 

1 — An Interactive Facility for Symbolic Mathe- 
matics," Proc. ACM 2d Symposium on Symbolic and 
Algebraic Manipulation , S. R. Patrick, ed. , Los 
Angeles, CA, 23-25 Mar. 1971. (AIDS) 

Griffith^ A, K. (1970), "Computer Recognition of 

Prismatic Solids," MAC Tech. Report 73, Project MAC, 
MIT, Cambridge, UA. (VIS) 

Griffith, A. K. (1973), "Mathematical Models for 
Automatic Line Detection,” J. ACM , 62-80, 1973. 

(VIS) 

Gross, Louis N. and Walker, Donald E. (1969), **0n- 
Line Computer Aids for Research in Linguistics,'* 
Information Processing , Vol. 68, A, J. H. Morrell, 
ed., North-Holland Publishing Company, Amsterdam, 
1969. (LANG) 

Guard, J. R. et al. (1969), "Seml-Au tons ted Mathe- 
matics," J. ACM , Vol. 16, 49-62, 1969, (TP) 

Guzman, A. (1968a), "Decomposition of a Visual Scene 
Into Three- Dimensional Bodlea," Proc. AFIPS 1968 
rail Joint Comp. Conf. , Vol. 33, 291-304, Thompson 
Book Co., Washington, D.C. (VIS) 

Guzman, A. (1968b), "Computer Recognition of Three- 
Dimensional Objects in g Visual Scene," MAC Tech. 
Report 59, thesis, Project MAC, HIT, Cambridge, MA, 
1968. (VIS) 

Guzman, A. (1971), "Analysis of Curved Line Drawings 
Using Context and Global Information,” Machine 
Intelligence , Vol. 6, B. Meltzer and 0. Mlchle, 
eds,, 325-376, Edinburgh Univ. Press, Edinburgh. 
(VIS) 

Haessler, R. W. (1971), "A Heuristic Programming 
Solution to* a Nonlinear Cutting Stock Problem," 
Management Science , Vol. 178, 793-802. (AIDS) 
Harris, Z. (1931), Structural Linguistics , Univ. of 
Chicago Press, Chicago, 1951. (LANG) 



Harris, Z. (1961), String Analysis of Sentence 
Structure, Uouton, The Hague, 1961. (LANG) 

Hart, P. E, et si. (1972), "Artificial Intelligence- 
Research and Applications," Annual Tech. Report to 
ARPA, Contract DAHC04-72-C-OOOS, Stanford Research 
Institute, Menlo Park, CA, Dec. 1972. (ROB) 

Hart, P. ( Nilsson, N., and Raphael, 9. (1968), "A 
Fore a l Basis for the Heuristic Determination of 
Minimum Cost Paths," IEEE Trans. Sys. Set. Cyber- 
netics , Vol. 4, No. 2, 100-107, 1968. (SEARCH) 

Hart, T. (1981), "SIMPLIFY," Memo 27, Artificial 
Intelligence Group, Project MAC, HIT, Cambridge, 

VA, 1961.. (AIDS) 

Hayes, P. (1971), "A Logic of Action," Machine 
Intel ligcnce , Vol. 6, B. Meltzer and D. Ulchie, 
eds. , 495-520, American Elsevier Publishing 
Company, 1971. (REP) 

Hayes, P. F. (1973), "The Frame Problem and Related 
Problems in Artificial Intelligence," Artificial 
and Human Thinking , A. Ellthorn and 0. Jonea, eda. , 
Elsevier Scientific Publishing Co., New York, 

1973, (REP) 

Hearn, A. C. (1968), "REDUCE , A User-Oriented Inter- 
active System for Algebraic Simplification," 
Interactive Systems for Experimental Applied 
Mathematics , 79-90, M. Klerer and J. Relnfelds, 
eds,, Academic Press, New York and London, 1968. 
(AIDS) 

Hearn, Anthony C. (1971), "REDUCE 2: A System and 

Language for Algebraic Manipulation," Proc. ACM 
2d Symposium on Symbolic and Algebraic Manipula- 
tion , S. R. Petrick, ed. , 23-25 Mar. 1971, Los 
Angeles, CA. (AIDS) 

Hendrix, G. (1973), "Modeling Simultaneous Actions 
and Continuous Processes," Artificial Intelligence , 
Vol. 4, 145-180, 1973. (REP) 

Hendrix, Gary G, , Thompson, Craig w. , and Slocum, 
Jonathan (1973), "Language Processing via Canonical 
Verbs and Semantic Models," Adv, Papers 3d Inti. 
Conf. on Artificial Intelligence , Stanford Univ., 
Stanford, CA, Aug. 1973. (LANG) 

Hewitt, C. (1969), "PLANNER: A Language for Proving 

Theorems in Robot*," 1st Inti. Joint Conf. on 
Artificial Intelligence , Washington, D.C. , 1969. 

(REP, SYS, DED) 

Hewitt, C. (1971), "Procedural Embedding of Knowledge 
in PLANNER," Proc. 2d Inti. Joint Conf. on Artifi- 
cial Intelligence , British Computer Society, London, 
England, 167-182, 1971. (REP, SYS, DED) 

Hewitt, C. (1972), "Description and Theoretical 
Analysis (Using Schemata) of PLANNER: A Language 

for Proving Theorems and Manipulating Models in a 
Robot," Ph.D. thesis, Dept, of Math., MIT, 
•Cambridge, UA, 1972. (SYS, REP, DED) 

Hewitt, C. , Bishop, P. , and Steiger, R. (1973), H A 
Universal Modular Actor Formalism for Artificial 
Intelligence," Adv. Papers 3d Inti. Conf. on Ar- 
tificial Intelligence . Stanford Univ., Stanford, 

CA, Aug. 1973. (SYS) 

Hintzman, D, L. (1968), "Explorations with a Dis- 
crimination Net Model for Paired-Associate 
Learning," J. Mathematical Psychology , Vol. S, 
123-162, 1968. (PSYC) ~ 

Horn, B. K. P. (1970), "Shape from Shading: A Method 

for Obtaining the Shape of a Smooth Opaque Object 
from One View," MAC Tech. Report 79, Project MAC, 
MIT, Cambridge, MA, Nov. 1970. (VIS) 

Horn, B. K. P. (1971), "The Binford-Horn Line Finder," 
Vision Flaah , No. 16, Artificial Intelligence 
Laboratory, MIT, Cambridge, MA. Later issued as 
MIT Artificial Intelligence Lab Memo 285, Mar. 

1973. (VIS) 


Huct, G. P. (1973a), "a Unification Algorithm for 
Type Theory," IRIA Laboria. 1973. (TP) 

Huet, G. P. (1973b), "A Mechanization of Type Theory," 
Adv. Papers 3d Inti. Conf. on Artificial Intelli- 
gence , Stanford Univ*, Stanford, CA, Aug. 1973. 

(TP) 

Huffman, D. A. (1971), "impossible Objects as Nonsense 
Sentences," Machine Intelligence. Vol. 6, B. Meltzer 
and 0. Ulchie, eds. 295-323, Edinburgh Univ. Press, 
Edinburgh, 1971, (VIS) 

Hunt, E. B. (1962), Concept Formation . John Wiley k 
Sons, New York. (PSYC) 

Hunt, E. B. (1974), Artificial Intelligence , Academic 
Press, 1974. (GEN) 

Hunt, E. B. and Hovland, C. 1. (1961), "Programming 
a Model of Human Concept Formation," Proc. Westwrn 
Joint Comp. Conf. . Vol. 19, 145-155. (PSVC) 

Igarashi, London, R. , and Luckhaa, D. (1973), 
"Automatic Verification of Programs I: A Logical 

Basis and Implementation," Stanford Univ. Artificial 
Intelligence Lab. Memo, No. 200, Stanford Univ., 
Stanford, CA, May 1973. (PROG) 

Jackson, P. C. (1974), Introduction to Artificial 
Intelligence , Mason and Lipacomb, New York, 1974. 
(GEN) 

Kaplan, R. U. (1972), "Augmented Transition Networks 
as Psychological Models of Sentence Comprehension," 
Artificial Intelligence , Vol. 3, 77-100, 1972, 

(PSYC) 

Kaplon, R. U. (1973), "A General Syntactic Processor," 
Natural Language Processing , R. Rustln, ed. , 293- 
241, Algorithmic Press, New York, 1973. (LANG) 

Katz, S. U. and Manna, Z. (1973), "Heuristic Approach 
to Program Verification," Adv. Papers 3d Inti. 

Conf. on Artificial Intelligence , 500-512, Stanford 
Univ., Stanford, CA, Aug. 1973, (PROC) 

Kay, Martin (1964), "A General Procedure for Rewriting 
Strings," presented at the 1964 Annual Meeting, 
Association for Machine Translation and Computa- 
tional Linguistics, Indiana Univ., Bloomington. 
(LANG) 

Kay, Martin (1967), "Experiments with a Powerful 

Parser," RM-5452-PR, RAND Corporation, Santa Monica, 
CA, 1967. (LANG) 

Kay, Martin (1973), "Tha Mind System," Natural 
Language Processing , R. Rustln, ed. , 155-187, 

Courant Computer Science Symposium 8, 20-21 Dec. , 
Algorithmic* Press, Inc., New York, 1973, (IANG) 
Kelly, U. (1970), "Visual Identification of People 
by Computer," Memo AI-130, Comp. Scl. Dept., Stan- 
ford Univ., Stanford, CA, July 1970. (VIS) 

King, J. (1969), **A Program Verifier," Doctoral 
dissertation, Comp. Scl. Dept., Came gie-Me lion 
Univ., Pittsburgh, PA, 1969. (PROG) 

Klster, J. et al. (1957), "Experiments in Chess," 

J. ACM , Vol. 4, 174-177, Apr. 1937. (GAME) 

Kling, R. E. (1971), "A Paradigm for Reasoning by 
Analogy," Artificial Intelligence , Vol. 2, No. 2, 
147-178, Fall 1971. (DED) 

Koffman, Elliot B. and Blount, Sumner E. (1973), 
"Artificial Intelligence and Automatic Programming 
in CAI , " Adv. Papers 3d Inti. Conf. on Artificial 
Intelligence , Stanford Univ., Stanford, CA, Aug. 
1973. (AIDS) 

Korsvold, K. (1965), "An On-Line Algebraic Simplifica- 
tion Program," Artificial Intelligence Project Memo 
No. 37, Stanford Univ., Stanford, CA, Nov. 1965. 
(AIDS) 

Kotok, A. (1962), "A Cheaa Playing Program for the 
IBM 7090," Bachelor 1 * thesis, MIT, 1962. (GAME) 


r 


D-19 



Kowalski, R. (1970), "Search Strategies for Theorem 
Proving,’* Mach 1 no Intelligence . Vol. 5, B. Moltzor 
and D. Michle, eds. , 181-200, American Elsevier 
Publishing Company, Mow York, 1970. (TP) 

Kowalski, R. and Kuohnor, D. (1971), **Llnaar 
Rosolutloa with Soloctlon Function,** Artificial 
Intelligence, Vol. 2, 227-260, 1971. (TP) 
Kullkowskl, C. A. and Volaa, 8. (1972), **Tho Modlcal 
Consultant Program-Glaucoma," Toch. Report TR-3, 
Computers in Blow dlclne , Dopt. of Comp. Set., 
Rutgera Only., Mow Brunswick, MJ, July 1972. 

(AIDS) 

Cuno, Busumu and Oottlngor, Anthony G. (1963), 
"Multiple-Path Syntactic Analyzer, ** Information 
Procoaslng 1962 , 306-312, Korth-Holland Publishing 
Company, Amsterdam, 1963. (LAMG) 

Levy, D. M. L. (1970), "Computer Chess— A Case 
Study," Machine Intelligence. Vol. 6, B. Meltzer 
and D. Michle, eds., 131-163, Edinburgh Unlv. 

Press, 1970. (GAME) 

Lighthlll, J. (1973), "Artificial Intelligence: 

A General Survey," Artificial Intelligence: A 

Paper Symposium, Science Research Council Pamphlet, 
Science Research Council, State House, High 
Holburn, London, Apr. 1973. (GEN) 

Lin, Shen (1970), "Heuristic Techniques for Solving 
Large Combinatorial Problems on a Computer," 
Theoretical Approaches to Mon-Numerical Problem- 
Solving . R. Banerji and M. Uesarovlc, eds., 410- 
418, Sprlnger-Verlag, New York, 1970. (AIDS) 
Lindsay, P. H. and Norman, D. A. (1972), Human 
Information Processing: An Introduction to 

Psychology , Academic Press, 1972. (PSYC-G) 

Locke, William N. and Booth, A. Donald, eds. (1933), 
Mschlne Translation of Languages: Fourteen Essays , 

John Wiley L Sons, New York, 1955. (LANG) 

Londe, Dave L. and Schoene, William J. (1968), "TGT: 
Transformational Grammar Tester," Proc, AFIPS 
Conf . . Vol. 32, 1968 Spring Joint Comp. Conf . , 
385-393, Thompson Book Co., Washington, D.C. 

(LANG) 

London, R. (1970), "Bibliography on Proving the 

Correctness of Computer Programs," Machine Intelli- 
gence . Vol. 5, B. Meltzer and D. Michle, eds., 369- 
380, American Elsevier Publishing Company, New 
York, 1970. (PROG-G) 

London, R. L. (1972), "The Current State of Proving 
Programs Correct," Proc. ACM 23th Annual Conf. , 
39-46, 1972. (PROG-G) 

Loveland, D. W. (1970), "A Linear Format for Resolu- 
tion," Symposium on Automatic Demonstration , M. 
Laudet, ed. , Sprlnger-Verlag, New York, 1970. (TP) 
Luckham, D. (1969), "Refinement Theorems In Resolu- 
tion Theory," Stanford Artificial Intelligence 
Project Memo Al-81, 24 Mar. 1969. Also In Proc. 
IRIA 1968 Symp. Autom. Demonstration, lecture 
Notes on Mathematics No. 123 , Sprlnger-Verlag, 

New York, 1970. (TP) 

Luckham, D. and Nilsson, N. (1971), "Extracting 
information from Resolution Proof Trees," Artifi- 
cial Intelligence , 1971. (TP) 

Manna, Z. (1969), "The Correctness of Programs," 

J. Computer Syst. Scl. t Vol. 3, May 1969. (PROG) 
Manna, Z. and Waldlnger, R. J. (1971), "Toward 

Automatic Program Synthesis," Comm, ACM , Vol. 14, 
No. 3, 131-165,* Mar. 1971. (PROG-C) 

Martin, W. A. (1967), "Symbolic Mathematical 

Laboratory," MAC Tech. Report 36 (thesis), Project 
MAC, MIT, Cambridge, MA, Jan. 1967. (AIDS) 

Martin, W. A. and Fatesan, R. J. (1971), "The MACSYHA 
System," Proc. AOI 2d Symposium on Symbolic and 
Algebraic Manipulation , S. R. Patrick, ed. , Los 
Angeles, CA, 23-23 Mar. 1971. (AIDS) 


McCarthy, J. (1938), "Programs with Common Sense," 
in "Mechanization of Thought Processes," Vol. 1, 
77-84, Proc. Symp., Nat. Phya. Lab.. London, 24-27, 
Nov. 1938. Reprinted in Semantic Information 
Processing , II. Ulnsky, ed. , 403-410, MIT Press, 
Cambridge, UA, 1968. (REP, OED) 

McCarthy, J. (1960), "Recursive Functions of Symbolic 
Expressions and Their Computation by Machine, Part 
1," Co—. ACM. Vol. 3, Mo. 4, 184-195, Apr. 1960. 
(SYS) 

McCarthy, John (1961), "Computer Programs for Checking 
Mathematical Proofs," Proc. Amer. Math. Soc. on 
Recursive Function Theory, Mew York. Apr. 1961. 

(TP) 

McCarthy, J. (1962), "Towards a Mathematical Science 
of Computation," Proc. IFIP Congr. . Vol. 62, North- 
Holland Publlahlng Company, Amsterdam, 1962. (PROG) 
McCarthy, J. (1963), "Situations, Actions and Causal 
Lavs," Memo. Mo. 2, Stanford Univ. Artificial 
Intelligence Project, 1963. Reprinted in Semantic 
Information Processing . M. Minsky, ed., 410-418, 

MIT Press, Cambridge, MA, 1968. (REP) 

McCarthy, J. and Painter, J. A. (1967), "Correctness 
. of a Compiler for Arithmetic Expressions," in Proc. 
Symp. Applied Mathematics , Vol. 19, Math. Aspects 
of Computer Science . J. T. Schwarcz, ed., 33-41, 
American Mathematical Society, Providence, RI, 

1967. * (PROG) 

McCarthy, J. et al. (1968), "A Computer with Hands, 
Eyes, and Ears," Proc. 1968 Fall Joint Comp. Conf. , 
Vol. 33, 329-338, Thompson Book Company, Washington, 
D.C. (ROB) 

McCarthy, J. and Hayes, P. (1969), "Some Philosophi- 
cal Problems from the Standpoint of Artificial 
Intelligence," Mschlne Intelligence , Vol. 4‘, American 
Elsevier Publishing Company, New York, 1969. (REP) 
McDermott, D. V. and Sussman, G. J. (1972), "The 
CDNNIVER Reference Manual," HIT, Artificial Intel- 
ligence Lab., Memo No. 239, May 1972. (SYS) 

Meltzer, B. and Michle, D. , eds. (1969), Machine 
Intelligence , Vol. 4, American Elsevier Publishing 
Company, New York, 1969. (GEN) 

Meltzer, B. and Michle, D. , eds. (1970), Machine 
Intelligence, Vol. 5, American Elsevier Publishing 
Company, New York, 1970. (GEN) 

Meltzer, B. and Michle, D. , eds. (1971), Machine 
Intelli gence , Vol. 6, American Elsevier Publishing 
Company, New York, 1971. (GDI) 

Meltzer, B. and Michle, D. , eds. (1972), Machine 
Intelligence , Vol. 7, American Elsevier Publishing 
Company, New York, 1972. (GEN) 

Michle, D. , ed. (1968), Machine Intelligence , Vol. 3, 
American Elsevier Publishing Company, Princeton, 

MJ, 1968. (GEN) 

Miller, G. A. (1936), "The Magical Number Seven, 

Plua or Minus Two," Psychological Review , Vol. 63, 
81-97. (PSYC) 

Millar, G. A., Galanter, E., and Pribram, K. H. 

(I960), Plana and the Structure of Behavior , Holt, 
Rinehart A Winston, New York. (PSYC) 

M inker, J., Fishman, D. H., and McSkimin, J. R. 

(1972), "The Maryland Refutation Proof Procedure 
System (MRPPS)," TR-208, Comp. Scl, Center, Unlv. 
of Maryland, College Park, MD, 1972. (TP) 

Minsky, M. (1961), "Steps Toward Artificial Intelli- 
gence," Proc. IRE , Vol. 49, 8-30, Jan. 1961, 

(SEARCH, GEN) 

Minsky, M. L. (1963), "Matter, Mind, and Models," 

IFIP , 1965.’ (GEN) 

Minsky, M. , ed. (1968), Semantic Information 

Processing , MIT Press, Cambridge, UA. (GEN, LANG-G) 
Minsky, M. (1974), "Frame-Systems : A Framework for 

Representation of Knowledge," forthcoming 1974. (REP) 


D-20 





Minsky , M. and Papert, S. (1973), Progress Report, 

M Memo 252, .Artificial Intelligence Laboratory, 
Cambridge, MA. (REP, VIS-C) 

Mittaan, B. (1973), "Can a Computer Baat Bobby 
Fischer?," Datamation , 84-87, Juna 1973. (CAME) 

Uoora, J. and Newell, A. (1973), "How Can Marlin 
Undarstand?," Dapt. of Con. Scl. Report, Caraegle- 
Mallon Unlv. , Nov. 15, 1973. (REP) 

Moses, J. (1967), "Symbolic Integration," MAC Tech. 
Raport 47, Project MAC, MIT, Cambridge, MA, Dec. 

1967. (AIDS) 

Moses, Joel (1971a), "Symbolic Integration: The 

Stormy Decade," proc. ACM 2d Symposium on Symbolic 
and Algebraic Manipulation, S. R. Patrick, ed. , 

Los Angeles, CA, 23-25 Mar. 1971. (AIDS-C) 

Mosas, Joel (1971b),' "Algebraic Simplification: A 

Guide for the Perplexed," Proc. ACM 2d Symposium 
on Symbolic and Algebraic Manipulation , S. R. 

Petrick, ed. , Los Angeles, CA, 23-25 Mar. 1971. 
(AIDS-G) 

Nelsser, U. (1967), Cognitive Psychology , Appleton- 
Century-Crof ts, New York. (PSYC) 

Nevatla, Ramakant and Blnford, Thomas 0. (1973), 
"Structured Descriptions of Complex Objects," 

Adv. Papars 3d Inti. Conf. on Artificial Intelli- 
gence , 641-645, Stanford Unlv., Stanford, CA, 

Aug. 1973. (VIS) 

Nevins, A. J. (1972), "A Human Oriented Logic for 
Automatic Theorem Proving," Tech, Report 268, MIT 
Artificial Intelligence Laboratory, UIT, Cambridge, 
MA, Oct. 1972. (TP) 

Nevins, J. L. , Whitney, D. E., and Simunovic, S. N. 
(1973), "Report on Advanced Automation," No. R-764, 
prepared for National Science Foundation, Grant No. 
GK-34094 and GI-39432X, The Charles Stark Draper 
Lab., Inc., Cambridge, MA, Nov, 1973. (ROB) 

Newell, A., ed. (1961), Information Processing 

Language V Manual , Prentice-Hall, Englewood Cliffs, 
NJ. (SYS) 

Newell, A. (1967), Studies In Problem Solving: Sub- 
ject 3 on the Cryptarithmetlc Task: Donald + 

Gerald » Robert , Camegie-Mellon Unlv, , Pittsburgh, 
PA. (PSYC, REP) 

Newell, A. (1970), "Remarks on the Relationship Be- 
tween Artificial Intelligence and Cognitive Psy- 
chology," Theoretical Approaches to Non-Nuwerlcal 
Problem Solving , R. B. Banerji end M. D. Mestrovic, 
eds., Sprtnger-Verlag, Berlin. (PSYC-G) 

Newell, A. (1972a), "Production Systems: Models of 

Control Structures," Visual Information Processing , 
Wm. Chsse, ed. , Academic Press, New York, 1972. 

(REP, PSYC) 

Newell, A. (1972b), "A Theoretical Exploration of 
Mechanisms for Coding the Stimulus," Coding 
Processes In Human Memory , A. Melter and E. Martin, 
eds., V. H. Winston, Washington, D.C. , 1972. '(REp, 
PSYC) 

Newell, A. (1973), "Artificial Intelligence and the 
Concept of Mind," Computer Models of Thought snd 
Language , Roger Schank and Kenneth Colby, eds, , 

W, H. Freeman L Co., San Francisco, 1973. (GEN) 
Newell, A. et si. (1971), Speech Understanding Systems , 
Advanced Research Projects Agency Report, 1971. 

Also published by North Holland Publishing Company, 
Amsterdam, The Netherlands, 1973. (LANG) 

Newell, A. snd Shaw, J. C. (1957), "Programming the 
Logic Theory Machine," Proc. West. Joint Comp. 

Conf, , 230-240, 1957. (SYS) 

Newell, A., Shaw, J. C. „ and Simon, Herbert (1957), 
"Empirical Explorations with the Logic Theory 
Machine: A Case Study in Heuristics," Proc. 

Wegtern Joint Comp. Conf. 1957 , 218-239. Also In 


Computers and Thought , E. A. Felgenbaum and J. 
Feldman, eds., 109-133, McGraw-Hill Book Company, 
New York, 1963. (DEO, PSYC) 

Newell, A., Shaw, J. C. , and Simon, H. A. (1958a), 
"Elements of s Theory of Human Problem Solving," 
Psychological Review , Vol. 65, 151-166, 1958. 

(PSYC) 

Newell, A., Shaw, J, , and Simon, H. (1958b), "Chess 
Playing Programs and the Problem of Complexity," 

IBM J. Rea. Develop. , Vol. 2, 320-335, Oct. 1958. 
Reprinted In Computers and Thought , Felgenbaum and 
Feldman, eds., 39-70, McGraw-Hill Book Company, 

New York, 1963. (CAME) 

Newell, A., Shaw, J, C. , and Simon, H. A, (1960), 
"Report on a General Problem-Solving Program for 
a Computer," Information Processing: Proc. Inti. 

Conf, Information Processing , 256-264, UNESCO, 

Paris. Also printed In Computers and Automation , 
July 1959. (DED, PSYC, TP) 

Newell, A. and Simon, H. A. (1956), "The Logic Theory 
Machine: A Complex Information Processing System," 

IRE Trans, on Information Theory . Vol. IT-2, No, 3, 
61-79. (SEARCH, DED, PSYC, TP) 

Newell, A. and Simon, H. (1961), "CPS, A Program 
that Simulates Human Thought," Lernende Automaton , 
H. Billing, ed. , 109-124, R. Oldenbourg, Munich. 
Reprinted in Computers and Thought , Felgenbaum and 
Feldman, eds., McGraw-Hill Book Company, New York, 
1963. (PSYC) 

Newell, A. and Simon, Herbert A. (1972), Human 
Problem Solving , Prentice-Hall, Englewood Cliffs, 

NJ, 1972. (PSYC) 

Newell, A. and Tonge, F. M. (1960), "An Introduction 
to Information Processing Language V," Comm. ACM , 
Vol. 3, 205-211. (SYS) 

Nllkson, N. J. (1969a), "Searching Problem-Solving 
and Gamc-Playlng Trees for Ulnlmal Cost Solutions," 
Information Processing 68 , Vol. 2, A. J. H. Morrel, 
ed. , 1556-1562, Korth-Holland Publishing Company, 
Amsterdam, 1969. (SEARCH) 

Nilsson, N. J. (1969b), "a Mobile Automaton: An 

Application of Artificial Intelligence Techniques," 
Proc. IJCAI , 509-515, May 1969, (ROB) 

Nilsson, N. J. (1971), Problem-Solving Methods In 
Artificial Intelligence , McGraw-Hill Book Company, 
New York, 1971. (SEARCH-G,* TP-G) 

Norton, L. (1966), "ADEPT— A Heuristic Program for 
Proving Theorems of Group Theory," MAC Tech. Report 
33, thesis, Project MAC, MIT, Cambridge, MA, 1966. 
(TP) 

Papert, S. (1968), "The Artificial Intelligence of 
Hubert L. Dreyfus, A Budget of Fallacies," MIT 
Artificial Intelligence Memo No. 54, Jan. 1968. 
(GEN) 

Papert, Seymour A. (1972), "Teaching Children 
Thinking," Programmed Learning and Educational 
Technology , Vol. 9, No. 5, Sept. 1972. (GEN) 

Paul, R . C. (1971), "Trajectory Control of a Computer 
Arm," Proc. 2d Inti. Joint Conf. on Artificial 
Intelligence , London, England, Sept. 1971. (ROB) 
Paul, R. C. (1973), "Modeling, Trajectory Calculation 
and Servoing of a Computer Controlled Arm," Ph.D. 
dissertation. Comp. Scl. Dept., Stanford Untv., 
Stanford, CA, 1973. (ROB) 

Paxton, William H. and Robinson, Ann E. (1973), "A 
Parser for s Speech Understanding System," Adv . 
Papers 3d Inti. Conf. on Artificial Intelligence , 

. Stanford Unlv., Stanford, CA, Aug. 1973. (LANG) 
Petrick, S. R. (1965), "A Recognition Procedure for 
Transformational Grammars," Ph.D. thesis, MIT, 
Cambridge, MA. (LANG) 


! 


0-21 



Petrlck, S. R. (1971), "Transformational Analysis," 
natural Language Processing , Courant Computer 
Science Symposium 8, 20-21 Dec. 1971, R. Rustln, 
ad., 27-41, Algorithmic* Praaa, New York. (LANG) 
Patrick, S. R. (1973), "Semantic Interpretation in 
th# Request System," Proc. Inti, Conf. on Computa- 
tional Linguistic* , Plaa, Italy, 1973. (LANG) 
Platrzykowakl, T. and Jensen, D. (1972), "A Complete 
Mechanization of Omega-Order Typa Theory," Proc. 

ACM Natl. Conf. , Voi. I, 82-92, 1972. (TP) 

Pingla, K. X. (1969), "Visual Parcaptlon by a 

Computer," Automatic Intarpratation and Classifica- 
tion of Images , A. Graaaalli, ad., 277-284, Acadamic 
Praaa, New York, London, 1969. (VIS) < 

Pingla, X. X,, Slngar, J. , and Wichnan, W. (1968), 
"Computer Control of a Mechanical Arm Through 
Visual Input," Proc, IFIP Cong. 1968 , Vol. 2. (ROB) 
Pingla, X. X. and Tonenbaun, J. U. (1971), "An 

Accommodating Edge Follower," IJCAI-2 , 1971. (VIS) 
Plath, Warren, J. (1973), "Transformational Grammar 
and Transformational Parsing in tha Request System," 
Proc. Inti. Joint Conf. on Computational Linguis- 
tics , Pisa, Italy, 1973. (LANG) 

Pohl, I. (1970), "Heuristic Search Viewed as Path 
Finding in a Graph," Artificial Intelligence , Vol. 

I, 193-204, 1970. (SEARCH) 

Prawltz, D. (1960), "An Improved Proof Procedure," 
Theorla, Vol. 26, 102-139, 1960. (TP) 

Qulllian, 11. R. (1968), "Semantic Memory, " Semantic 
Information Processing , HIT Preaa, Cambridge, 1IA. 
(REP, PSYC, LANG) 

Qulllian, Ross (1969), "The Teachable Language 

Comprahander : A Simulation Program and Theory of 

Language," Comm. ACM , Vol. 12, 459-476, 1969. 

(REP, PSYC, LANG) 

Raphael, B. (1964a), "A Computer Program Which 

'Understands,'" Proc. AFIPS Fall Joint Comp. Conf. , 
577-589, 1964. (DED, LANG) 

Raphael, B. (1964b), "SIR: A Computer Program for 

Semantic Information Retrieval," HIT, June 1964, 
Reprinted in Semantic Information Processing , W. 
Ulnsky, cd., HIT Press, Cambridge, HA, 1968. (DED, 
LANG) 

Raphael, B. (1968), "Programming A Robot," Proc. IFIP 
Cong. 68 , H135-H140 , Edinburgh, 1968. (ROB) 

Raphael, B. et al. (1971), "Research and Applica- 
tions — Artificial Intelligence," Stanford Research 
Institute Final Report, Contract NASV-2164, 

National Aeronautics «nd Space Administration, 

Dec. 1971. (ROD) 

Raphael, □. and Green, C. (1968), "Tho Use of Theorem 
Proving Techniques in Question Answering Systems," 

J. ACM , 169, 1968. (DED, REP) 

Rcboh, R. and Sacerdoti, E. (1973), "A Preliminary 
QLISP Manual," SRI Artificial Intelligence Center 
Tech. Note 81, Stanford Research Institute, Uenlo 
Park, CA, Aug, 1973. (SYS) 

Reddy, D. R. (1967), "Computer Recognition of Con- 
nected Speech,” J. ASA , Vol. 42, 329-347. (LANG) 
Reddy, D. R. et al. (1973), "The Hearsay Speech 
Under* landing System: An Example of the Recogni- 

tion Process," Adv. Papers 3d Inti. Conf. on 
Artificial Intelligence , Stanford Univ. , Stanford, 
CA, Aug. 1973. (LANG) 

Roberta, L. G. (1963), "Machine Perception of Three- 
Dimensional Solids," MIT Lincoln Lab., Lexington, 

MA, Tech. Report No. 315, May 1963. Also in 
Optical and EHc tro-Optical Information Processing , 
J. T. Tippett et al., eda. , MIT Press, Cambridge, 

MA, 1965. (VIS) 

Robinson, J. A. (1965), "A Machine-Oriented Logic 
Based on the Resolution Principle," J . API , Vol. 

12, No. 1, 23-41, Jan. 1965. (DED, TP) 


Robinson, J. A. (1969), "a Note on Mechanizing 

Higher-Order Logic," Machine Intelligence . Vol. 5, 

B. Meltzer and D. Mlchle, #d*., 123-133, Edinburgh 
Univ. Press, Edinburgh, 1969. (TP) 

Rosen, C. A. (1972); "Robots, Productivity, and 
Quality," Proc. ACM Natl. Conf. 1972, Boston, MA, 
Aug. 14-16, 1972. (ROB-G) 

Rosen, C. (1973), "Exploratory Research in Advanced 
Automation," SRI Report to National Science Founda- 
tion, Grant GI-38200X, Stanford Research Institute, 
Menlo Park, CA, Dec. 1973. (ROB) 

Roszak, T. (1972), There the Wasteland Ends; Politics 
and Transcendence in Poat-Induatrlal Society , 
Doubleday, 1972. (GEN) 

Rulif son, J. F., kaldinger, R. J. , and Derksen, J. A. 
(1971), "A Language for Writing Problem- 
Solving Programs," Proc. IFIP . TA-2, 111-113, I97i. 
(REP, SYS) 

Rulifaon, J. F. , Derksen, J. A., and kaldinger, R. J. 
(1972), "QA4: A Procedural Calculus for Intuitive 

Reasoning," Tech. Note 73, Artificial Intelligence 
Center, Stanford Research Institute, Menlo Park, CA, 

1972. (REP, SYS, DED) 

Rumelhart, D. £., Lindsay, P. H., and Norman, D. A. 
(1972), "A Process Model for Long-Term Memory," 
Organization and Memory . E. Tulving and k. Donaldson, 
ed*.. Academic Press, New York, 1972. (PSYC) 
Rumelhart, David £. and Norman, Donald A. (1973), 
"Active Semantic Networks as a Model of Human 
Memory," Adv. Papers 3d Inti. Conf. on Artificial 
Intelligence , Stanford Univ., Stanford, CA, Aug. 

1973. (PSYC) 

Russell, R. (1964), "XALAH— The Game and the Program," 
Stanford Univ. Artificial Intelligence Project Memo 
No. 22, 3 Sept. 1964. (GAME) 

Rustln, R., ed. (1973), Natural Language Processing , 
Algorithmic Press, New York, 1973. (LANG-G) 

Ryder, J. L. (1971), "Heuristic Analysis of Large 
Trees os Generated in the Game of GO," Al Memo 155, 
Artificial Intelligence Project, Stanford Univ,, 
Stanford, CA, 1971. (GAME) 

Sacerdoti, Earl D. (1973), "Planning in a Hierarchy 
of Abstraction Spaces," Adv. Papers 3d Inti. Conf. 
on Artificial Intelligence , Stanford Univ., Stan- 
ford, CA, Aug. 1973. To appear in Artificial 
Intelligence , 1974. (DED) 

Sakai, T. , Nagao, M. , and Knode, T. (1972), "Computer 
Analysis and Classification of Photographs of Human 
Faces," First USA-Japan Comp, Conf. Proc. , Oct. 

1972. (VIS) 

Samuel, A. (1959), "Some Studies in Machine Learning 
Using the Game of CHECXERS," IBM J. Research 
Develp. , Vol. 3, 211-229, 1959. Reprinted in 
Computers and Thought , E. Fei^enbaum and J. Feldman, 
eda., 71-105, McGraw-Hill Book Company, New fork, 
1963. (GAME) 

Samuel, A. L. (1967), "Some Studies in Machine 
Learning Uaing the Game of CHECKERS II— Recent 
Progress," IBM J. Res. Dev. , Vol. 11, 601-617, 

1967. (GAME) 

Sandewall, E. J. (1971), "Representing Natural 
Language Information in Predicate Calculus," 

Machine Intelligence , Vol. 6, B. Meltzer and 0. 
Mlchle, eda., 255-277, Edinburgh Univ. Press, 
Edinburgh. (REP) 

Sandewall, E. J. (1972a), "Formal Methods in the 

Design of Question-Answering System," J. Artificial 
Intelligence , 237, 1972. (REP) 

Sandewall, E. J. (1972b), "PCF-2, A First-Order 
Calculus for Expressing Conceptual Information," 
Computer Science Report, Comp. Scl. Dept., Uppsala 
Univ., Uppsala, Sweden. (REP) 


022 





] 


. -J 

. J 



Schank, R. C. (1972), "Conceptual Dependency: A 

Theory of Mature! Language Und ere tend ins," Cognitive 
Psychology , yol. 3, 552-631, 1972. (PSYCH, LANG) 
Schenk, R. C. (1973), "The Fourteen Primitive Actions 
end Thel* Inferences," Stanford AIM-183 Comp. Set. 
Dept., Stanford Univ., Stanford, CA, 1973. (REP, 
LANG, PSYC) 

Schenk, R. C. et el. (1972), "Print tive Concepts 
Underlying Verbs of Thought," Stanford AIM-162, 

Comp. Set. Dept., Stanford Univ., Stanford, CA, 

1973. (REP, LANG, PSYC) 

Schenk, t. C. end Colby, It. (1973), Computer Models 
of Thought and Language . W. H. Freeman a Co., San 
Francisco, 1973. (LANG-G, PSYC-G) 

Schenk, R. C., Goldman, Nell, Rieger, Charles J., 
and Rlesback, Chris (1973), "MARGIE: Memory, 

Analysis, Response Generation, and Inference on 
English," Adv. Papers 3d Inti. Cont. on Artificial 
Intelligence , Stanford Univ., Stanford, CA, Aug. 

1973. (LANG) 

Schank, R. C. , Tesler, L. , and Weber, S. (1970), 
"SPINOZA II: Conceptual Case-Based Natural 

Language Analysis," Memo AIM-109, Stanford Artifi- 
cial Intelligence Project, Stanford Univ., Stanford, 
CA, 1970. (LANG) 

Sche Inman, V. D. (1969), "Design of a Computer- 
Controlled Manipulator," thesis, Dept, of M.E. , 
Stanford Univ., Stanford, CA. Available as Stan- 
ford AIM-92, June 1969. (ROB) 

Shannon, C. (1950), "Prograawing a Digital Computer 
for Playing Chess," Philosophy Magazine , Vol. 41, 
356-375, Mar. 1950. Reprinted in The World of 
Mathematics . Vol. 4, J. R. Newman, ed., Simon and 
Schuater, New York, 1954. (GAME) 

Shirai, Y. (1972), "A Heterarchical Program for 

Recognition of Polyhedra," Memo No. 263, Artificial 
Intelligence Laboratory, HIT, Cambridge, UA. (VIS) 
Shirai, Y. (1973), "A Context Sensitive Line Finder 
for Recognition of Polyhedra," Artificial Intelli- 
gence . Vol. 4, No. 2, 95-119, Summer 1973. (VIS) 
Shortliffe, E. H. et si. (1973), "An Artificial 

Intelligence Program to Advise Physicians Regarding 
Antimicrobial Therapy," Computers and Biomedical 
Research , Vol. 6, 544-560, 1973. (AIDS) 

Slawons, R. F. (1965), "Answering English Questions 
by Computer: A Survey," Comm. ACM , Vol. 8, 53-70, 

1965. (LANG-G) 

Simmons, R. F. (1969), "Natural Language Question- 
Ansaerlng Systems: 1969," Comm. AQ4 , Vol. 13, 

15-30, 1970. (LANG-G) 

Simmons, Robert F. (1973), "Semantic Networks: Their 

Computation and Use for Understanding English 
Sentences," Computer Models of Thought and Language , 
K. U. Colby and Roger Schank, eds,, W. H. Freeman 
and Company, San Francisco, CA, 1973. (LANG) 
Simmons, Robert F. , Klein, S., and McConlogue, D. 

(1964), "indexing and Dependency Logic for Answering 
English Questions," American Documentation . Vol. 15, 
NO. 3, 196-204, 1964. (LANG) 

Simon, H. A. (1963), "Experiments with a Heuristic 
Compiler," J, ACU, Vol. 10, No. 4, Oct. 1963, 

(PROG) 

Simon, H. A. (1969), The Sciences of the Artificial , 
MIT Press, Cambridge, MA. (GEN) 

Simon, H. A. (1972), "The Heuriatic Compiler," 
Representation and Meaning , H. A. Simon ana L. 
Siklossy, eds., Prentice-Hall, Englewood Cliffs, 

NJ, 1972. (PROG) 

Simon, H. A. and Felgenbaum, E. A. (1964), "An 
Information-Processing Theory of Some Effects of 
Similarity, Familiarization, and Mean lngfulneaa in 
Verbal Learning," J. Verbal Learning and Verbal 
Behavior , Vol. 3, 385-396, (PSYC) 


Slagle, J. (1961), "A Computer Program for Solving 
Problems la Freshman Calculus (SAINT)," Lincoln 
Laboratory Report 5G-001, May 1961. (SEARCH, AIDS) 
Slagle, J. R, (1963), "A Heuristic Program that 
Solves Symbolic Integration Problems In Freshman 
Calculus," J. ACM. Vol, 10, No. 4, 507-520, Oct. 

1963. Also In Computers and Thought. X. Felgenbaum 
and J. Feldman, eds., 191-203, McGraw-Hill Book 
Company, New York, 1963. (AIDS, SEARCH) 

Slagle, J. (1965), "Experiments with a Deductive 
Question- Answering Program," Comm. ACM . Vol. 8, 
792-798, Dec. 1965, (DTD) 

Slagle, J. R. (1967), "Automatic Theorem Proving with 
Rename able and Semantic Reeolutlon," J. ACM . Vol. 

14, 687-697, 1967. (TP) 

Slagle, J, R. (1971), Artificial Intelligence; The 
Heuriatic Programming Approach . McGraw-Hill Book 
Company, New York, 1971. (GEN, GAME-G, SEARCH-G) 
Slagle, J. R. (1970), "Heuristic Search Programs," 
Theoretical Approaches to Hon-Numerlcal Problsm 
Solving . R. Banerjl and M. I. Mesarovic, eds., 
246-273, Spr Inge r-Ver lag, New York, 1970. (SEARCH) 
Slagle, J. R. and Dixon, J. (1969), "Experleenta with 
Some Programs that Search Game Treea," J. ACM . Vol. 
16, No. 2, 189-207, Apr. 1969. (GAME, SEARCH) 
Solomonoff, R. (1966), "Some Recent Work In Artificial 
Intelligence," Proc, IEEE. Vol. 54, No. 112, Dec. 
1966. (GEN) 

Sperling, G. (I960), "The Information Available in 
Brief Visual Presentations. " Psychological Mono- 
graphs . Vol. 74. (PSYC) 

Srldharan, N. S. (1971), "An Application of Artificial 
Intelligence to Organic Chemical Synthesis," thesis. 
State Univ. of New York at Stony Brook, New York, 

July 1971. (AIDS) 

Srldharan, N. S. et al. (1973a), "A Heuristic Program 
to Discover Syntheses for Complex Organic Molecules," 
Stanford Artificial Intelligence Memo 205, Stanford 
Univ., Stanford, CA, June 1973. (AIDS) 

Srldharan, N. S. (1973b), "Search Strategies for the 
Taak of Organic Chemical Synthesis," Adv. Papers 3d 
lntl. Conf. on Artificial Intelligence , Stanford 
Univ., Stanford, CA, Aug. 1973. (AIDS) 

Sternberg, S. (1966), "High Speed Scanning In Human 
Memory," Science . Vol. 153, 652-654. (PSYC) 

Suasman, G. J. (1973), "A Computational Model of Skill 
Acquisition," Tech. Note AI TR-297, Artificial 
Intelligence Laboratory, MIT, Cambridge, MA, Aug, 
1973. (DED, PROG) 

Suasman, G. J. and McDermott, D. V. (1972), "From 
PLANNER to CONN IYER— A Genetic Approach, " Proc. 

AF1PS FJCC , Vol. 41, 1171-1180, 1972. (SYS) 
Teitelman, W. (1969), "Toward a Programming Labora- 
tory," Proc. lat Inti. Joint Conf. on Artificial 
Intelligence , Washington, D.C. , 1969. (PROG) 
Teitelman, W.- (1972a), "Do What I Mean," Cowputera 
and Automation , Apr. 1971. (PROG, SYS) 

Teitelman, W. (1972b), "Automated Programming— The 
Programmer* a Assistant," Proc, Tall Joint Cowp. 

Conf. , Dec. 1972. (PROG, SYS) 

Teitelman, W. (1973), "CLISP— Conversational LISP," 

Adv. Papers 3d lntl. Conf. on Artificial Intelli- 
gence , Stanford Univ,, Stanford, CA, Aug. 1973. 

(PROG, SYS) 

Teitelman, W. (1974), INTERLISP Reference Manual , 

Xerox and Bolt, Beranek and Newman. Copies 
available from Xerox Palo Alto Research Center, 

Palo Alto, CA. (SYS) 


D-23 



Tenenbaum, J. M. (1973), "On Locating Objects by 
Thalr Distinguishing Features In Multlsenaory 
Images," SRI Artificial Intelligence Center Tech. 
Note 84, Stanford Research Institute, Hanlo Park, 
CA, Sapt. 1973. To appear In Computer Graphics 
and Image Processing , 1974. (VIS) 

Tenenbaum, J. m; at al. (1974), "An Interactive 
Facility for Scene Analysis Research,** SRI Artifi- 
cial Intelligence Canter Tech. Rote. 87, Stanford 
Research Institute, Menlo Park, CA, 1974. (VIS) 
Thompson, F. B. (1966), "English for the Computer," 
Proc. AF1PS 1968 Fall Joint Comp. Conf. , Vol. 29, 
349-356, Spartan Books, Raw York, (LANG) 

Thompson, F. B. et al. (1969), "REL: A Rapidly 

Extensible Language System," Proc. 24th Natl. ACM 
Conf. , 1969. (LANG) 

Thorne, J. , Bratley, P. , and Dewar, H. (1968), "The 
Syntactic Analysis of English by Machine," Machine 
Intelligence , Vol* 3, D. Michle, ed. , American 
Elsevier Publiahing Company, New York, 1968. 

(LANG) 

Tinbergen, N. (1951), The Study of Instinct , 

Clarendon Press, Oxford, 1951, (PSYC) 

Tonga, F. (1961), A Heuristic Program for Aaaembly 
Line Balancing , Prentice Hall, 1961. (AIDS) 

Turing, A. M. (1949), "Checking a Large Routine," 
Report of a Conference on High Speech Automatic 
Calculating-Machines, UcLennon Laboratory, Unlv, 
of Toronto, Canada. (PROG) 

Turing, A. U. (1950), "Computing Machinery and Intel- 
ligence," Mind , Vol. 59, 433-460, Oct. 1950. Re- 
printed In Computers and Thought , 11-35, E. 
Felgenbaum and J. Feldman, eds. , McGraw-Hill Book 
Company, New York, 1963. (GEN) 

Vicens, P. (1969), "Aspects of Speech Recognition 
by Computer," Report CS-127, Ph.D. thesis, Comp. 
Sci. Dept., Stanford Unlv., Stanford, CA. (LANG) 
Waldlngcr, R. J. and Lee, R. C. T. (1969), "PROW: A 

Step Toward Automatic Program Writing," Proc. Inti. 
Joint Conf, on Artificial Intelligence , 241-252, 
1969. (PROG) 

Waldlnger, R. J. and Levitt, K. N. (1973), "Reasoning 
About Programs," Tech. Note 86, SRI Artificial 
Intelligence Center, Oct. 1973, Stanford Research 
Institute, Menlo Park, CA. To Appear In Artificial 
Intelligence , 1974. (PROC) 

Walker, Donald E. , ed. <1964), English Preproceaaor 
Manual , The Mitre Corporation, Bedford, UA, 1964 
(SR-132). (LANG) 

Walker, Donald E. (1973), "Speech Understanding, 
Computational Linguistics, and Artificial Intelli- 
gence," Tech. Note C5, SRI Artificial Intelligence 
Center, Stanford Research Institute, Menlo Park, 

CA, Aug. 1973. (LANG) 

Waltz, D. G. (1972), "Generating Semantic Descrip- 
tions from Drawings of Scenes with Shadows," AZ 
TR-271, Artificial Intelligence Laboratory, MXT, 
Aug. 1972. (VIS) 

Waterman, D. A. (1970), "Generalization Learning 
Techniques for Automating the Learning of Heuris- 
tics," Artificial Intelligence , Vol. I, Noa. 1 
and 2, Spring 1970. (GAME) 

Waterman, D. A. and Newell, A. (1971), "Protocol 
Analysis as a Task for Artificial Intelligence," 
Artificial Intelligence , Vol. 2, Nos. 2 and 3, 
285-318, 1971. (AIDS) 

Waterman, D. A. and Newell, A. (1973), "PAS-II: An 

Interactive Task-Free Version of an Automatic 
Protocol Analysis System," Adv. Papers 3d Inti. 
Conf. on Artificial Intelligence , Stanford Unlv., 
Stanford, CA, Aug. 1973. (AIDS) 
wegbreit, Ben (1973), "Heuristic Methods for 
Mechanically Deriving Inductive Assertions," 

Adv. Papers 3d Inti. Conf. on Artificial Intelli- 


gence , Stanford Unlv., Stanford, CA, Aug. 1973. 

(PROG) 

Velaaaan, C. (1967), LISP 1.5 Primer . Dickenson 
Preaa, 1987. (SYS) 

Velzenbaum, J. (1966), " ELIZA — A Computer Program 
for the Study of Natural Language Communication 
Between Man and Machine,** Comm. ACM, Vol. 9, 36- 
45, 1966. (LANG) 

Weixenbaum, J. (1972), **0n the Impact of the Computer 
on Society,** Science. Vol. 176, No. 609, 1972, 

(GEN) 

West, J. D. (1967), **A Heuristic Model for Scheduling 
Large Projects with Limited Resources," Management 
Science , Vol. 13B, 359-377. (AIDS) 

Wlnograd , T. (1971), "Procedures as Representation 
for Data in a Computer Program for Understanding 
Natural Language," Tech. Report Al TR-17, MIT, 
Cambridge, UA, 1971. Published as Understanding 
Natural Language , Academic Preaa, New York, 1972. 
(REP, DED, LANG) 

Winston, P. H. (1970), "Learning Structural Descrip- 
tions from Examples," Tech. Report Al TR-231, 
Artificial Intelligence Laboratory, MIT, Cambridge, 
MA, 1970. (REP, VIS) 

Winston, P. H. (1972), "The MXT Robot," Machine 
Intelligence , Vol. 7, 431-463, B. Meltzer and D. 
Michle, eds. , American Elsevier Publishing Company, 

1972. (ROB, VIS) 

Woods, W. A. (1970), "Transition Netmork Grammars for 
Natural Language Analysis," Comm. ACM , Vol. 13, 
591-606, 1970. (LANG) 

Woods, W. A. (1973), "An Experimental Parsing System 
for Transition Network Grammars," Natural Language 
Processing , R. Rustin, ed., 111-154, Algorlthmlcs 
Press, New York, 1973. (LANG) 

Woods, W. A., Kaplan, R. M. , Nash-Webber, G. (1972), 
"The Lunar Science Natural Language Information 
System: Final Report," BBN Report No. 2376, Bolt, 

Bcranek and Newman, Inc., Cambridge, MA, June 1972. 
(LANG) 

Woods, W. A, and Makhoul, J. (1973), "Mechanical 

Inference Problems in Continuous Speech Understsnd- 
ing," Adv. Papers 3d Inti. Conf, on Artificial 
Intelligence , Stanford Unlv., Stanford, CA, Aug. 

1973. (LANG) 

Wooldridge, D. (1963), "An Algebraic Simplify Program 
In LISP," Artificial Intelligence Project Memo No. 
11, Stanford Unlv., Stanford, CA, Dec. 1963* 

(AIDS) 

Wos, L. T. , Carson, D. G., and Robinson, G. A. (1964), 
"The Unit Preference Strategy In Theorem Proving," 
Proc. AFIPS . Fall 1964, Vol. 25, 615-621, 

Spartan Books, Washington, D.C. (TP) 

Wos, L. T. , Robinson, G. , and Carson, D. (1963), 
"Efficiency and Completeness of the Set of Support 
Strategy in Theorem-Proving," J, API , Vol. 12, No. 

4, 536-341, Oct. 1963. (TP) 

Yaklmovsky, Yoram and Feldman, Jerome A. (1973), "A 
Semantics-Baaed Decision Theory Region Analyzer," 
Adv. Papers 3d Inti. Conf. on Artificial Intelli- 
gence , Stanford Unlv., Stanford, CA, Aug. 1973. 

(VIS) 

Yates, R. , Raphael, 8., and Hart, T. (1970), "Resolu- 
tion Graphs," Artificial Intelligence, Vol. 1, No. 

4, 1970. (TP) 

Zobrlst, A. (1969), "A Model of Visual Organization 
for the Game of 00," Proc. AFIPS, Spring 1969 , 
103-112. (GAME) 

Zobrlst, A. and Carlson, F. , Jr. (1973), "An Advice- 
Taking Chess Computer," Scientific American, June 
1973. (GAME) 

Zeicky, Arnold M. et al. (1963), "The Mitre Syntactic 
Analysis Procedure for Transformational Grammars, 
Proc. AFIPS. Fall 1965. Vol. 27, 317-326. (LANG) 


D-24 



Appendix E 


Edward A. Feigenbaum. “The art of artificial intelligence - Themes and case studies 
of knowledge engineering,” pp 227-240, in the Proceedings of the National Computer 
Conference - 1978 , copyrighted 1978. Reproduced by permission of AFIPS Press. 




i 


& ' 








Appendix E 


The art of artificial intelligence — Themes and case studies of 
knowledge engineering 


by EDWARD A. FEIGENBAUM 

Stanford Umrtmry 
UamtonL, Cjhianm 

INTRODUCTION— AN EXAMPLE 

This paper wdl examine emerging themes of knowledge en- 
gineering, illustrate then! with case studies drawn from the 
work of the Stanford Heuristic P rogramming Project, and 
discuss general issues of knowledge engineering art and 
practice. 

Let me begin with an example new to our workbench: a 
system called PUFF, the early fruit of a collaboration be- 
tween our project and a group at the Pacific Medical Center 
(PMC in San Francisco.* 

A physician refers a patient to PMC*s pulmonary function 
testing lab for diagnosis of possible pulmonary function dis- 
order. For one of the tests, the patient inhales and exhales 
a few times in a tube connected to an instrument/computer 
combination. The instrument acquires data on flow rates 
and volumes, the so-called flow- volume loop of the patient's 
lungs and airways. The computer measures certain param- 
eters of the cur/e and presents them to the diagnostician 
(physician or PUFF) for interpretation. The diagnosis is 
made along these lines: normal or diseased; restricted lung 
disease or obstructive airways disease or a combination of 
both; the severity; the likely disease typc(s) (e.g., emphy- 
sema, bronchitis, etc.); and other f acton important for di- 
agnosis. 

PUFF is given not only the measured data but also certain 
items of information from the patient record, e.g., sex, age. 
Dumber of pack-years of cigarette smoking. The task of the 
PUFF system is to infer a diagnosis and print it out in 
English in the normal medical summary form of the inter- 
pretation expected by the referring physician. 

Everything PUFF knows about pulmonary function di- 
agnosis is contained in (currently) 35 rules of the IF. . . 
THEN. . . form. No textbook of medicine currently records 
these rules. They constitute the partly-public, partly-private 
knowledge of an expert pulmonary physiologist at PMC, and 
were extracted and polished by project engineers working 
intensively with the expert over a period of time. Here is an 
example of a PUFF rule (the unexplained acronyms refer to 
various data measurements): 


* Or. J. Oftbara. Or. fL TiRu. Jofcn Kuas. D tan* McGaa*. 


RULE 31 
IF: 

1) The severity of obstructive airways 
disease of the patient is greater than or 
equal to mild, and 

2) The degree of diffusion defect of the 
patient is greater than or equal to mild, 
and 

3) The tic (body box) observed/predicted of 
the patient is greater than or equal to 1 10 
and 

4) The observed-predicted difference in 
rv/ tic of the patient is greater than or 
equal to 10 

THEN: 

1) There is strongly suggestive evidence 
(.9) that the subtype of obstructive airways 
disease is emphysema, and 

2) It is definite (1.0) that "OAD. 

Diffusion Defect, elevated TLC, and elevated 
RV together indicate emphysema.’* is one of 
the findings. 

One hundred cases, carefully chosen to* span the variety 
of disease states with sufficient exemplary information for 
each, were used to extract the 55 rules. As the knowledge 
emerged, it was represented in rule form, added to the sys- 
tem and tested by running additional cases. The expert was 
sometimes surprised, sometimes frustrated, by the occa- 
sional gaps and inconsistencies in the knowledge, and the 
incorrect diagnoses that were logical consequences of the 
existing rule set. The interplay between knowledge engineer 
and expert gradually expanded the set of rules to remove 
most of these problems. 

As cumulation of techniques in the art demands and al- 
lows, a new tool was not invented when an old one would 
do. The knowledge engineers pulled out of their toolkit a 
version of the MYCIN system (to be discussed later), with 
the rules about infectious diseases removed, and used it as 
the inference engine for the PUFF diagnoses. Thus PUFF, 
like MYCIN, is a relatively simple backward-chaining infer- 
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cnee system. It seek* a valid line-of- reasoning based on its 
rules and rooted in the instrument and patient data. With a 
little more work at fitting some existing tools together, 
PUFF will be able to explain this line-of-reasoning, just as 
MYCIN does. 

As it is, PUFF only prints out the final interpretation, of 
which the following is an example: 

PATIENT DATA; 

The degree of dyspnea: MODERATELY-SEVERE 
The severity of coughing: MILD 
Sputum production MODERATELY-SEVERE 
• The number of pack- yean of smoking: 43 
referral diagnosis: BRONCHITIS 
IVC/IVC-predicted: 80 
RV/RV-predicted: 191 
FVOFVC* predicted: 37 
TLC(body boxjobserv ed/predictcd: 127 
Predicted FEVj/FVC 83 
TLC(DLCO)ob$erved/predicUd: 83 
FEVt/FVC ratio: 30 
RVm-C Observed- Predicted: 21 
MMF/MMF-predicted: 19 
the DLCO/D LCO-predicted: 43 
The slope (F50L_obs-F25_obsyFVC-obs: 19 

DECREE OF OBSTRUCTIVE AIRWAYS DISEASE: 

OAD degree by SLOPE: (MODERATELY-SEVERE 
700) 

OAD degree by MMF: (SEVERE 900) 

OAD degree by FEVl; (MODERATELY-SEVERE 
700) 

FINAL OAD DEGREE: (MODERATELY-SEVERE 
910) (SEVERE 900) 

No conflict. Final degree: 

(MODERATELY-SEVERE 910) 

INTERPRETATION: 

Obstruction is indicated by curvature of 
the flow-volume loop. 

Forced Vital Capacity is normal and peak 
flow rates are reduced, suggesting 
airway obstruction. 

Flow rate from 25-75 of expired volume is 
reduced, indicating severe airway 
obstruction. 

OAD, Diffusion Defect, elevated TLC, and 
elevated RV together indicate emphysema. 

OAD, Diffusion Defect, and elevated RV 
indicate emphysema. 

Change in expired flow rates following 
bronchodilanon shows that there is 
reversibility of airway obstruction. 

The presence of a productive cough is an 
indication that the OAD is of the 
bronchitic type. 

Elevated lung volumes indicate 
overinilation. 

Air trapping is indicated by the elevated 


difference between observed and predicted 
RV/TLC ratios. 

Improvement in airway resistance indicates 
some reversibility of airway 
Airway obstruction is consistent with the 
patient’s smoking history. 

The airway obstruction accounts for the 
patient’s dyspnea. 

Although broochodilators were not 
useful in this one case, prolonged use may 
prove to be beneficial to the patient. 

The reduced diffusion capacity indicates 
airway obstruction of the mixed 
bronchitic and emphysematous types. 

Low diffusing capacity indicates toss of 
alveolar capillary surface. 

Obstructive Airways Disease of mixed types 

150 cases not studied during the knowledge acquisition 
process were used for a test and validation of the rule set. 
PUFF inferred a diagnosis for each. PUFF-produced and 
expert-produced interpretations were coded for statistical 
analysis to discover the degree of agreement. Over various 
types of disease states, and for two conditions of match 
between human and computer diagnoses ("same degree of 
severity” and "within one degree of severity”), agreement 
ranged between approximately 90 percent and 100 percent. 

The PUFF story is'just beginning and will be told perhaps 
at a later NCC. The surprising punchline to my synopsis is 
that the current state of the PUFF system as described 
above was achieved in less than 50 hours of interaction with 
the expert and less than 10 man-weeks of effort by the 
knowledge engineers. We have learned much in the past 
decade of the an of engineering know ledge- based intelligent 
agents! 

In the remainder of this essay, I would like to discuss the 
route that one research group, the Stanford Heuristic Pro- 
gramming Project, has taken, illustrating progress with case 
studies, and discussing themes of the work. 

ARTIFICIAL INTELLIGENCE <fe KNOWLEDGE 
ENGINEERING 

The dichotomy that was used to classify the collected 
papers in the volume Computers and Thought still charac- 
terizes well the motivations and research efforts of the AI 
community. First, there are some who work toward the 
construction of intelligent artifacts, or seek to uncover prin- 
ciples, methods, and techniques useful in such constriction. 
Second, there are those who view artificial intelligence as 
(to use Newell’s phrase) "theoretical psychology.” seeking 
explicit and valid information processing models of human 
thought. 

For purposes of this essay, 1 wish to focus on the moti- 
vations of the first group, these days by far the larger of the 
two. I label these motivations "the intelligent agent view- 
point” and here is my understanding of that viewpoint: 

"The potential uses of computers by people to accom- 
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plish tasks can be *ooe-dimensionalized’ into a spectrum 
representing the nature of instruction that oust be given 
the computer to do its job. Call it the WHAT-to*HOW 
spectrum. At one extreme of the spectrum, the user sup* 
plies his intelligence to instruct the machine with precision 
exactly HOW to do his job, step-by-step. Progress in 
Computer Science can be seen as steps away from the 
extreme ‘HOW’ point on the spectrum: the familiar pan- 
oply of assembly languages, subroutine libraries, compil- 
ers, extensible languages, etc. At the other extreme of 
the spectrum is the user with his real problem (WHAT be 
wishes the computer, as his instrument, to do for him). 
He aspires to communicate WHAT he wants done in a 
language that is comfortable to him (perhaps English); via 
communication modes that are convenient for him (in- 
cluding perhaps, speech or pictures); with some general- 
ity, some vagueness, imprecision, even error, without 
having to lay out in detail all necessary subgoals for ad- 
equate performance — with reasonable assurance that he 
is addressing an intelligent agent that is using knowledge 
of his world to understand his intent, to till in his vague- 
ness, to make specific his abstractions, to correct his 
errors, to discover appropriate subgoals, and ultimately 
to translate WHAT he really wants dooe into processing 
steps that define HOW it shall be dooe by m real computer. 
The research activity aimed at creating computer pro- 
grams that act as “intelligent agents’* near the WHAT 
end of the WHAT-To-HOW spectrum can be viewed as 
the long-range goal of Al research.’* (Feigenbaum, 1974) 

Our young science is still more an than science. An: “the 
principles or methods governing any craft or branch of learn- 
ing.'* Arc “skilled workmanship, execution, or agency.'* 
These the dictionary teaches us. Knuth tells us that the 
endeavor of computer programming is an an, in just these 
ways. The an of constructing intelligent agents is bath pan 
of and an extension of the programming an. It is the an of 
building complex computer programs that represent and rea- 
son with knowledge of the world. Our an therefore lives in 
symbiosis with the other worldly arts, whose practitioners — 
experts of their an — hold the knowledge we need to con- 
struct intelligent agents. In most “crafts or branches of 
learning" what we call “expertise" is the essence of the an. 
And for the domains of knowledge that w« touch with our 
an, it is the "rules of expertise" or the rules of “good 
judgment” of the expen practitioners of that domain that 
we seek to transfer to our programs. 

Lessons of the past 

Two insights from previous work are pertinent to this 
essay. 

The tint concerns the quest for generality and power of 
the inference engine used in the performance of intelligent 
acts (what Minsky and Papen [see Goldstein and Pa pen, 
1977] have labeled “the power strategy”). We must hypoth- 
esize from our experience to date that the problem solving 
power exhibited io an intelligent agent's performance is pri- 


marily a consequence of the specialist’s knowledge em- 
ployed by the agent, and only very secondarily related to 
the generality and power of the inference method employed. 
Our agents must be knowledge-rich, even if they are meth- 
ods-poor. In 1970, reporting the first major summary-of- 
results of the DENDRAL program (to be discussed later ), 
we addressed this issue as follows: 

**. . . general problem-solvers are too weak to be used 
as the basis for building high-performance systems. The 
behavior of the best general problem-solvers we know, 
human problem-solvers, is observed to be weak and shal- 
low, except in the areas in which the human problem- 
solver is a specialist. And it is observed that the transfer 
of expertise between specialty areas is slight. A chess 
master is unlikely to be an ex pen algebraist or an expen 
mass spectrum analyst, etc. In this view, the expen is the 
specialist, with a specialist’s knowledge of his area and a 
specialist’s methods and heuristics.” (Feigenbaum, Buch- 
anan and Lederberg, 1971, p. 1ST) 

Subsequent evidence from our laboratory and all others 
has only confirmed this belief. 

Ai researchers have dramatically shifted their view on 
generality and power in the past decade. In 1967, the can- 
onical question about the DENDRAL program was: “It 
sounds like good chemistry, but what does it have to do 
with Al?” In 1977, Goldstein and Papen write of a paradigm 
shift in Al: 

“Today there has been a shift in paradigm. The fun- 
damental problem of understanding intelligence is not the 
identification of a few powerful techniques, but rather the 
question of how to represent large amounts of knowledge 
in a fashion that permits their effective use and interac- 
tion.” (Goldstein and Papen, 1977). 

The second insight from past wort concerns the nature of 
the knowledge that an expen brings to the performance of 
a task. Experience has shown us that this knowledge is 
largely heuristic knowledge, experiential, uncertain — mostly 
“good guesses” and ”good practice,” in lieu of facts and 
rigor. Experience has also taught us that much of this knowl- 
edge is private to the expen, not because he is unwilling to 
share publicly how he performs, but because he is unable. 
He knows more than he is aware of knowing. [Why else is 
the Pfa.D. or the Internship a guild-like apprenticeship to a 
presumed “master of the craft?*' What the masters really 
know is not written in the textbooks of the masters.] But 
we have learned also that this private knowledge can be 
uncovered by the careful, painstaking analysis of a second 
party, or sometimes by the expen himseif. operating in the 
context of a Urge number of highly specific performance 
problems. Finally, we have learned that expertise is multi- 
faceted, that the expen brings to bear many and varied 
sources of knowledge in performance. The approach to cap- 
turing his expertise must proceed on many fronts simulta- 
neously. 



230 


National Computer Conference, 1973 


The knowledge engineer 

The knowledge engineer is that second party just dis- 
cussed. She works intensively with an expert to acquire 
domain-specific knowledge and organize it for use by a pro- 
gram. Simultaneously she is matching the tools of the Al 
workbench to the cask at hand — program organizations, 
methods of symbolic inference, techniques for the structur- 
ing of symbolic information, and the like. If the tool fits, or 
nearly fits, she uses it. If not, necessity mo then AI inven- 
tion, and a new tool gets created. She builds the early ver- 
sions of the intelligent agent, guided always by her Intent 
that the program eventually achieve expert levels of per- 
formance in the task. She refines or reconcepcualizes the 
system as the increasing amount of acquired knowledge 
causes the AI tool to “break** or slow down intolerably. 
She also refines the human interface to the intelligent agent 
with several aims: to make the system appear “comforta- 
ble' * to the human user In his linguistic transactions with it; 
to make the sy sum's inference processes understandable to 
the user, and to make the assistance controllable by the user 
when, in the conux: of a real problem, he has an insight 
that previously was not elicited and therefore not incorpo- 
rated. 

In the next section, 1 wish to explore (in summary form) 
some case studies of the knowledge engineer's art. 

CASES FROM THE KNOWLEDGE ENGINEER’S 

WORKSHOP 

I will draw material for this section from the work of my 
group at Stanford. Much exciting work in knowledge engi- 
neer ng is going on elsewhere. Since my intent is not to 
survey literature but to illustrate themes, at the risk of ap- 
pearing parochial I have used as case studies the work I 
know best. 

My collaborators (Professors Lederberg and Buchanan) 
and 1 began a series of projects, initially the development of 
the DENDRAL program, in 1963. We had dual motives: 
first, to study scientific problem solving and discovery, par- 
ticularly the processes scientists do use or should use in 
inferring hypotheses and theories from empirical evidence; 
and second, to conduct this study in such a way that our 
experimental programs would one day be of use to working 
scientists, providing intelligent assistance on important and- 
difficult problems. By 1970. we and our co-workers had 
gained enough experience that we felt comfortable in laying 
out a program of research encompassing work on theory 
formation, knowledge utilization, knowledge acquisition, 
explanation, and knowledge engineering techniques. Al- 
though there were some surprises along the way, the general 
lines of the research are proceeding as envisioned. 


THEMES 

As a road map to these case studies, it is useful to keep 
in mind certain major themes: 


Generation-and-test: Omnipresent in our experiments is the 
“classical" generation-and-test framework that has been 
the hallmark of AI programs for two decades. This is not 
a consequence of a doctrinaire attitude on our part about 
heuristic search, but rather of the usefulness and suffi- 
ciency of the concept. 

Situation^Action Rules: We have chosen to represent the 
knowledge of experts in this form. Making no doctrinaire 
claims for the universal applicability of this representa- 
tion, we nonetheless point to the demonstrated utility of 
the rule-based representation. From this representation 
flow rather directly many of the characteristics of our 
programs: for example, ease of modification of the knowl- 
edge, ease of explanation. The essence of our approach 
is that a rule must capture a “chunk" of domain knowl- 
edge that is meaningful, in and of itself, to the domain 
specialist. Thus our rules bear only a historical relation- 
ship to the production rules used by Newell and Simon 
(1972) which we view as “machine-language program- 
ming" of a recognize act machine. 

Tht Domain^pecifie Knowledge: It plays a critical role in 
organizing and constraining search. The theme is that in 
the knowledge is the power. The interesting actio a arises 
from the knowledge base, not the inference engine. We 
use knowledge in rule form (discussed above), in the form 
of inferentiaUy-rich models based on theory, and in the 
form of rbleaus of symbolic data and relationships (i.c., 
frame-like structures). System processes are made to con- 
form to natural and convenient representations of the do- 
main-specific knowledge. 

Flexibility to modify the knowledge base: If the so-called 
“grain size" of the knowledge representation is chosen 
properly (i.e„ small enough to be comprehensible but 
large enough to be meaningful to the domain specialist), 
then the rule-based approach allows great flexibility for 
aHHirt^ removing, or changing knowledge in the syst em . 

Une-ofrtasoning: A central organizing principle in the de- 
sign of knowledge-based intelligent agents is the mainte- 
nance of a Une-of-reasoning that is comprehensible to the 
domain specialist. This principle is, of course, not a logical 
necessity, but seems to us to be an engineering principle 
of major importance. 

Multiple Sources of Knowledge: The formation and main- 
tenance (support) of the line-of-reasooing usually require 
the integration of many disparate sources of knowledge. 
The representational and inferential problems in achieving 
a smooth and effective integration are formidable engi- 
neering problems. 

Explanation: The ability to explain the tioe-of-reasoning in 
a language convenient to the user is necessary for appli- 
cation and for system development (e.g., for debugging 
and for extending the knowledge base). Once again, this 
is an engineering principle, but very important. What con- 
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ttitutet. M aa explanation'* U ooc a simple concept, and 

considerable thought needs to be given, in each cue, to 

the ttruczuhng of explanation*. 

CASE STUDIES 

la this lection 1 will try to iHustnte these themes with 
venous case studies. 

DBS ORAL: Inferring chemical structures 

nroocw Km 

Begun m 1965, this collaborative project with the Stanford 
Mass Spectrometry Laboratory has become one of the long- 
est-lived continuous efforts in the history of AI (a fact that 
in no small way has contributed to its success). The basic 
framework of generation-and-test and rule-based represen- 
tation has proved rugged and extendable. For us the DEN- 
ORAL system has been a fountain of ideas, many of which 
have found their way, highly metamorphosed, into our other 
projects. For example, our long-standing commitment to 
rule-based representations arose out of our (successful) at- 
tempt to head off the imminent ossification of DENDRAL 
caused by the rapid accumulation of new knowledge in the 
system around 1967. 

Teak 

To enumerate plausible structures (atom-bond graphs) for 
organic molecules, given two kinds of information: analytic 
instrument data from a mass spectrometer and a nuclear 
magnetic resonance spectrometer: and user-supplied con- 
straints on the answers, derived from any other source of 
knowledge (instrumental or contextual) available to the user. 

Bepratotatiooi 

Chemical structures are represented as node-link graphs 
of atoms (nodes) and bonds (links). Constraints on search 
are represented as subgraphs (atomic configurations) to be 
denied or preferred. The empirical theory of mass spectrom- 
etry is represented by a set of rules of the general form: 

Situation: Particular atomic 
configuration 
(subgraph) 

i 

• 

| Probability, P, 

! of occurring 

i 

i 

• 

V 

Action: Fragmentation of the 
particular configuration 
(Breaking links) 


Rules of this form are natural and expressive to mass 
i p o c trom e triita. 


Sketch ef method 

DENDRAL’s inference procedure is a heuristic search 
that takes place in three stages, without feedback: plan- 
geaerate-test. 

“Generate'* (a program called CONGEN) is a generation 
process for plausible structures. Its foundation is a combi- 
natorial algorithm (with mathematically proven properties of 
completeness and non-redundant generation) that can pro- 
duce all the topologically legal candidate structures. Con- 
straints supplied by the user or by the “Plan’* process prune 
and steer the generation to produce the plausible set (i.c., 
those satisfying the constraints) and not the enormous legal 
set. 

“Test" refines the evaluation of plausibility, discarding 
less worthy candidates and rank-ordering the remainder for 
examination by the user. “Test** first produces a “pre- 
dicted" set of instrument data for each plausible candidate, 
using the roles described. It then evaluates the worth of 
each candidate by comparing its predicted data with the 
actual Input data. The evaluation is based on heuristic cri- 
teria of goodness-of-fit. Thus, “test** selects the “best** 
ex planations of the 

“Plan” produces direct (i.c., not chained) inference about 
likely substructure in the molecule from patterns in the data 
that are indicative of the presence of the substructure. (Pat- 
terns in the data trigger the left-hand-sides of substructure 
rules). Though composed of many atoms whose intercon- 
nections are given, the substructure can be manipulated as 
atom-like by “generate." Aggregating many units entering 
into a combinatorial process into fewer higher-level units 
reduces the size of the combinatorial search space. “Plan” 
sets up the search space so as to be relevant to the input 
data. '‘Generate is the inference tactician; “Plan” is the 
inference strategist. There is a separate “Plan” package for 
each type of instrument data, but each package passes sub- 
structures (subgraphs) to “Generate.” Thus, there is a uni- 
form interface between “Plan” and “Generate.” User-sup- 
plied constraints enter this interface, directly or from user- 
assist packages, in the form of substructures. 


Sources of knowledge 

The various sources of knowledge used by the DEN- 
DRAL system are: 

Valences (legal connections of atoms); stable and un- 
stable configurations of atoms; roles for mass spectrom- 
etry fragmentations; roles for NMR shifts; experts’ roles 
for planning and evaluation; use r-iupp lied constraints 
(contextual). 
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Results 

DENDRAL's structure elucidation abilities are, paradox- 
ically, both very general and very narrow. In general, DEN- 
DRAL handles all molecules, cyclic and tree-like. In pure 
structure elucidation under constraints (without instrument 
data), CONGEN is unrivaled by human performance. In 
structure elucidation with instrument data, DENDRAL's 
performance rivals ex pen human performance only for a 
small number of molecular families for which the program 
has been given specialist's knowledge, namely the families 
of interest to our chemist collaborators. I will spare this 
computer science audience the list of names of these fami- 
lies. Within these areas of knowledge-intensive specializa- 
tion, DENDRAL's performance is usually not only much 
faster but also more accurate than expert human perform- 
ance. 

The statement just made summarizes thousands of runs 
of DENDRAL oa problems of interest to our experts, their 
colleagues, and their students. The results obtained, along 
with the knowledge that had to be given to DENDRAL to 
obtain them, are published in major journals of chemistry. 
To date, 25 papers have been published there, under a series 
title " Applications of Artificial Intelligence for Chemical 
Inference: (specific subject) " (see for example, the Bu- 
chanan, Smith, et al., 1976, reference). 

The DENDRAL system is in everyday use by Stanford 
chemists, their collaborators at other universities and col- 
laborating or otherwise interested chemists in industry. 
Users outside Stanford access the system over commercial 
computerfcommunicarioos network. The problems they are 
solving are often difficult and aovei. The British government 
is currently supporting work at Edinburgh aimed at trans- 
ferring DENDRAL to industrial user communities in the 
UK. 

IHmeim 

Representation and extensibility. The representation cho- 
sen for the molecules, constraints, and rules of instrument 
data interpretation is sufficiently close to that used by chem- 
ists in thinking about structure elucidation that the knowl- 
edge base has been extended smoothly and easily, mostly 
by chemists themselves in recent years. Only one major 
reprogramming effort took place in the last 9 yean — when 
a new generator was created to deal with cyclic structures. 

Representation and the Integration of multiple sources of 
knowledge. The generally difficult problem of integrating 
various sources of knowledge has been made easy in DEN- 
DRAL by careful engineering of the representations of ob- 
jects, constraints, and rules. W« insisted on a common lan- 
guage of compatibility of the representations with each other 
and with the inference processes: the language of molecular 
structure expressed as graphs. This leads to a straightfor- 
ward procedure for adding a oew source of knowledge, say, 
for example, the knowledge associated with a new type of 
instrument data. The procedure is this: write rules that de- 
scribe the effect of the physical processes of the instrument 
on molecules using the situatioa=>*ction form with molec- 


ular graphs on both sides; any special inference process 
using these rules must pass its results to the generator only 
(f) in the common graph language. 

It is today widely believed in Al that the use of many 
diverse sources of knowledge in problem solving and data 
interpretation has a strong effect on quality of performance. 
How strong is, of course, domain-dependent, but the impact 
of bringing just one additional source of knowledge to bear 
on a problem can be startling. In one difficult (but not un- 
usually difficult) mass spectrum analysis problem,* the pro- 
gram using its mass spectrometry knowledge alone would 
have generated an impossibly large set of plausible candi- 
dates (over 1.25 million!). Our engineering response to this 
was to add another source of data and knowledge, proton 
NMR. The addition on a simple interpretive theory of this 
NMR data, from which the program could infer a few ad- 
ditional constraints, reduced the set of plausible candidates 
to one, the right structure! This was not an isolated result 
but shoved up dozens of times in subsequent analyses. 

DENDRAL and data. DENDRAL’s robust models (top- 
ological, chemical, instrumental) permit a strategy of find- 
ing solutions by generating hypothetical "correct answers" 
and choosing among these with critical tests. This strategy 
is opposite to that of piecing together the implications of 
each data point to form a hypothesis. We call DENDRAL's 
strategy largely model-driven, and the other data-driven. 
The consequence of having enough knowledge to do model- 
driven analysis is a large reduction in the amount of data 
that must be examined since data is being used mostly for 
verification of possible answers. In a typical DENDRAL 
mass spectrum analysis, usually no more th a n about 15 data 
points out of a typical total of 2^0 points are processed. This 
important point about data reduction and foe us -of- xnen don 
has been d i s cu ssed before by Gregory (1968) and by the 
vision and speech research groups, but is not widely under- 
stood. 

Co n cl u sion. DENDRAL was an early herald of Al's shift 
to the knowledge-based paradigm. It demonstrated the point 
of the primacy of domain-specific knowledge in achieving 
expert levels of performance. Its development brought to 
the surface important problems of knowledge representa- 
tion, acquisition, and use. It showed that, by and large, the 
Al tools of the first decade were sufficient to cope with the 
demands of a complex scientific problem- solving task, or 
were readily extended to handle unforeseen difficulties. It 
demonstrated that ATs conceptual and programming tools 
were capable of producing programs of applications interest, 
albeit in narrow specialties. Such a demonstration of com- 
petence and sufficiency was important for the credibility of 
the Al field at a critical juncture in its history. 

META-DEN DRAL: inferrinf ruUs of mass speerromerry 

Historical note 

The META-DENDRAL program is a case study in auto- 
matic acquisition of domain knowledge. It arose out of our 


* Tt* firm of m acyclic mm vita formuia C3JH45N. 




Tbs Ait of Artificial Imeffigence 


233 


DENDRAL work for two reasons; first, a decision that with 
DENDRAL we had a sufficiently firm foundation on which 
to pursue our long-standing interest in processes of scientific 
theory formation; second, by a recognition that the acqui- 
sition of domain knowledge was the bottleneck problem in 
the building of ippiications-oriented intelligent agents. 


Task 

META-DENDRAL's job is to infer rules of fragmentation 
of molecules in a mass spectrometer for possible later use 
by the DENDRAL performance program. The inference is 
to be made from actual spectra recorded from known mo- 
lecular structures. The output of the system is the set of 
fragmentation rules discovered, summary of the evidence 
supporting each rule, and a summary of contra-indicating 
evidence. User-supplied constraints can also be input to 
force the form of rules along desired lines. 


Representations 

The rules are. of course, of the same form as used by 
DENDRAL that was described earlier. 


Sketch of method 

META-DENDRAL, like DENDRAL. uses the genera- 
tioo-and-test framework. The process is organized in three 
stages; Reinterpret the data and evidence 

(INTSUM); generate plausible candidates for rules (RU- 
LE GEN); test and refine the set of plausible rules (RULE- 
MOD). 

INTSUM; gives every data point in every spectrum an 
interpretation as a possible. (highly specific) fragmentation. 
It then summarizes statistically the “weight of evidence” 
for fragmentations and for atomic configurations that ca u se 
these fragmentations. Thus, the job of INTSUM is to trans- 
late data to DENDRAL subgraphs and bond-breaks, and to 
summarize the evidence accordingly. 

RULE GEN: conducts a heuristic search of the space of 
all rules that are legal under the DENDRAL rule syntax and 
the user-supplied constraints. It searches for plausible rules, 
i.e., those for which positive evidence exists. A search path 
is pruned when there is no evidence for rules of the class 
just generated. The search tree begins with the (single) most 
general rule (loosely put, “anything” fragments from “any- 
thing”) and proceeds level-by-level toward more detailed 
specifications of the “anything.” The heuristic stopping cri- 
terion measures whether a rule being generated has become 
too specific, in particular whether it is applicable to too few 
molecules of the input set. Similarly there is a criterion for 
deciding whether an emerging rule is too general. Thus, the 
output of RULEGEN is a set of candidate rules for which 
there is positive evidence. 

RULE MOD: tests the candidate rule set using more com- 


plex criteria, including the presence of negative evidence. 
It removes redundancies in the candidate rule set; merges 
rules that are supported by the same evidence; tries further 
specialization of candidates to remove negative evidence; 
and tries further generalization that preserves positive evi- 
dence. 


Remits 

META-DENDRAL produces rule sets that rival in quality 
those produced by our collaborating experts. In some tests, 
META-DENDRAL re-created rule sets that we had previ- 
ously acquired from our experts during the DENDRAL proj- 
ect. In a more stringent test involving members of a family 
of complex ringed molecules for which the mass spectral 
theory had not been completely worked out by chemists. 
META-DENDRAL discovered rule sets for each subfamily. 
The rules were judged by experts to be excellent and a paper 
describing them was recently published in a major c h em ical 
journal (Buchanan. Smith, et al. 1976). 

In a test of the generality of the approach, a version of 
the META-DENDRAL program is currently being applied 
to the discovery of rules for the analysis ofnuclear magnetic 
resonance data. 


MYCIN and TE1RESIAS: Medical diagnosis 

Historical note 

MYCXN originated in the Ph.D. thesis of E. Shortliffe 
(now Shortliffe, M.D. as well), in collaboration with the 
Infectious Disease group at the Stanford Medical School 
(Shortliffe, 1976). TE IRES LAS, the Ph.D. thesis work of R. 
Davis, arose from issues and problems indicated by the 
MYCIN project but generalized by Davis beyond the bounds 
of medical diagnosis applications (Davis, 1976). Other 
MYCIN-related theses are in progress. 


Tasks 

The MYCIN performance task is diagnosis of blood in- 
fections and meningitis infections and the recommendation 
of drug treatment. MYCIN conducts a consultation (in Eng- 
lish) with a physician-user about a patient case, constructing 
lines-of-reaaoning leading to the diagnosis and treatment 
plan. 

The TEIRESIAS knowledge acquisition task can be de- 
scribed as follows; 

In the context of a particular consultation, confront the 
expert with a diagnosis with which he does not agree. Lead 
him systematically back through the line-of-reasoning that 
produced the diagnosis to the point at which he indicates 
the analysis went awry. Interact with the expert to modify 
offending rules or to acquire new rules. Rerun the consul- 
tation to test the solution and gain the expert's concurrence. 
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Represent* tioos: 

MYCIN’s rules are of the form: 

IF (conjunctive clauses) THEN (implication) 

Here is an example of a MY CIN rule for blood infections. 

RULE 83 
IF: 

1) The site of the culture is blood, and 
2} The gram stain of the organism is 
gramseg, and 

3) The morphology of the organism is 
rod, and 

4) The patient is a compromised host 
THEN: 

There is suggestive evidence (.6) that 
the identity of the organism is 
pseudomonas-aeruginosa 

TEIRESIAS allows the representation of MYCIN-lilce 
rules governing the use of other rules, L«., rule •based strat- 
egies. An example follows. 

METARULE 2 
IF: 

1) the patient is a compromised host, and 

2) there are rules which mention in their 
premise pseudomonas 

3) there are rules which mention in their 
premise UebsieHea 

THEN: 

There is suggestive evidence (.4) that the 
former should be dooe before the Latter. 


Sketefc of method 

MY CIN employs a generation- and- test procedure of a 
familiar sort. The generation of steps in the line-of-reasoning 
is accomplished by backward chaining of the rules. An IF- 
side clau se is either immediately true or false (as determined 
by patient or test data enured by the physician in the con- 
sultation); or is to be decided by subgoeling. Thus, “test 1 * 
is interleaved with “generation" and serves to prune out 
incorrect lines-of-reasoning. 

Each rule supplied by an expert has associated with it a 
“degree of certainty’* representing the expert's confidence 
in the validity of the rule (a number from 1 to 10). MY CIN 
uses a particular ad-hoc but simple model of inexact reason- 
ing to cumulate the degrees of certainty of the rules used in 
an inference chain (Shortliffe and Buchanan, 1975). 

It follows that there may be a number of “somewhat true" 
lincs-of-reasocung — some indicating one diagnosis, some in- 


dicating another. All (above a threshold) are used by the 
system as sources of knowledge indicating plausible lines- 
of-reasoning. 

TEIRESIAS' rule acquisition process is based on a record 
of MYCIN's search. Rule acquisition is guided by a set of 
rule models that dictate the form and indicate the likely 
content of new rules. Rule models are not given in advance, 
but are inferred from the knowledge base of existing rules. 


Language of Interaction 

The language used looks like it might be English but Is 
actually the dialect “Doctor-ese" used by members of the 
tribe of healing arts practitioners. Reasonably simple lan- 
guage processing methods suffice. When ambiguities cf in- 
terpretation are encountered, they are fed back to the user 
for decisions about meaning. 


Example diagnosis and treatment plan 

The following is an example of a diagnosis reported by 
MYCIN to one of its users following a consultation. An 
example consultation will not be shown, but one can be 
found in Shortliffe* s book (Shortliffe, 1976). 

My therapy recommendations will be designed 
to treat for organisms that are either very 
likely or, although less likely, would have 
a significant effect on therapy selection 
if they were present. It is important to 
cover for the following probable 
infections) and associated organism(s): 

INFECTION- 1 is CYSTITIS 

(Item 1) PSEUDOMONAS-CEPACIA (ORGANISM-3) 
(Item 2) CTTROBACTER-DIVERSUS [ORGANISM-*] 
(Item 3) E.COLi [ORGANISM-3] 

INFECTION-2 is UPPER-RESFIRATORY- 
INFECTION 

(Item 4) LISTERIA [ORGANISM-2] 

INFECTION-3 is BACTEREMIA 

(Item 3) ENTEROCOCCUS [ORGANISM-l] 

[REC-l] My preferred therapy recommendation 
is aa follows: 

In order to cover for Items (1 2 3 4 3): 

Give the following in combination: 
t) KAN A MY CIN 

Dose: 730 mg (7.3 mtfkg) ql2h IM (or IV) 
for 23 days 

Comments: Modify dose in renal failure 
2) PENICILLIN 

Dose: 2.500,000 units (23000 units/kg) 
q4h IV for 28 days 
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No clinical evaluation of MYCXN is currently being done, 
la a preliminary evaluation, dooe outside of the clinical 
setting, a panel of expert judges was asked to rate MYCIN’s 
performance along a number of dimensions (e.g., infecting 
organism identification, organism significance). In 90 per* 
cent of the cases submitted to the judges, a majority of the 
judges said that the pr ogra m’s decisions were the-same*as 
or as-good-u the decisions they would have made. 

Frptaaartnrt 

MY ON can expose its line of reasoning by offering ex- 
planations of vinous kinds. “Why” and “How” questions 
can be asked by the user during the consultation. “Why** 
is the user's way of inquiring, “Why do you want to know 
that?** “How” is his way of asking, “How did you conclude 
that. . . ?** In a post-consultation, he can similarly inquire 
about how the diagnostic conclusions were reached, and 
inquire about the reasons why certain other conclusions 
were ruled out. Examples of the MYCIN-TEIRESIAS ex- 
planation capabilities follow: 

MYCIN: Is the patient's illness with 
ORGANISM- 1 a hospital-acquired infection? 

USER: WHY 

TEIRESIAS: (i.e., WHY is it important to 
determine whether or not the infection with 
ORGANISM-1 was acquired while the patient 
was hospitalized?) 

[1.0] This will aid in determining the 
category of ORGANISM-!. 

It has already been established that 

[1.1] the gram stain of ORGANISM- 1 is 
gramneg, and 

[1.2] the morphology of ORGANISM- 1 is rod. 
and 

[1.3] the aerobicity of ORGANISM-1 is 
facultative 

Therefore, if 

[1.4] the infection with ORGANISM- 1 was not 
acquired while the patient was hospitalized 

then 

there is strongly suggestive evidence (.8) 
that the category of ORGANISM- 1 is 
enterobactehaceae 

Also: there is weakly suggestive evidence 
(.1) that the identity of 


ORGANISM- 1 is pseudomonas-aeruginosa 
[RULE 37] 

USER: HOW 1J 

TEIRESIAS: (i.e. HOW was it established 
that the aerobicity of ORGANISM- 1 is 
facultative?] 

The following were used: 

[3.1] RULE Z7 indicated there is strongly 
suggestive evidence (.8) that the 
aerobicity of ORGANISM- 1 is facultative 

[3.2] RULE 27 indicated there is weakly 
suggestive evidence (.2) that the 
aerobicity of ORGANISM- 1 is anaerobic 

Since this gave a cumulative CF of (.8) for 
facultative, and (.2) for anaerobic, it has 
been established that the aerobicity of 
ORGANISM- 1 is facultative. 

USER: WHY DIDN’T YOU CONSIDER 
STREPTOCOCCUS AS A POSSIBILITY FOR 
ORGANISM-1? 

TEIRESIAS: The following rule could have 
been used to determine that the identity of 
ORGANISM-1 was streptococcus: RULE 33 

But clause 2 (“the morphology of the 
organism is coccus”) was already known 
to be false for ORGANISM-1, so the rule was 
never tried. 


XMecsssfctt 

A few short comments aimed at knowledge engineering 
are in order. 

English is the language most natural and comfortable for 
our experts and users, and was therefore the language cho- 
sen for interactive consultation, explanation, and external 
representation of the rules (the internal format is 1NTER- 
USP). This situation is not peculiar to doctors; in most 
areas of application of intelligent agents I believe that Eng- 
lish (i.e., natural language) will be the language. of choice. 
Programming an English language processor and front-end 
to such systems is not a scary enterprise because: 

(a) the domain is specialized, so that possible interpreta- 
tions are constrained. 

(b) specialist-talk is replete with standard jargon and ster- 
eotyped ways of expressing knowledge and queries— just 
right for text templates, simple grammars and other simple 
processing schemes. 
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(c) the ambiguity of interpretation resulting from simple 
schemes can be dealt with easily by feeding back interpre- 
tations for confirmation. If this is done with a pleasant **I 
didn’t quite understand you. . tone, it is not irritating to 
the user. 

English may be exactly the wrong language for represen- 
tation and interaction in some domains. It would be awk- 
ward, to say the least, to represent DENDRAL’s chemical 
structures and knowledge of mass spectrometry in English, 
or to interact about these with a user. 

Simple explanation schemes have been a part of the AI 
scene for a number of yean and are not hard to implement. 
Really good models of what explanation is as a transaction 
between user and agent, with programs to implement these 
models, will be the subject (I predict) of much future re- 
search in AI. 

Without the explanation capability, 1 assert, user accept- 
ance of MYCIN would have been nil, and there would have 
been a greatly diminished effectiveness and contribution of 
our experts. 

MYCIN was the first of our programs that forced us to 
deal with what we had always • understood: that experts' 
knowledge is uncertain and that our inference engines had 
to be made to reason with this uncertainty. It is less impor- 
tant that the inexact reasoning scheme be formal, rigorous, 
and uniform than it is for the scheme to be natural to and 
easily understandable by the experts and users. 

All of these points can be summarized by saying that 
MYGN and its TE I RES IAS adjunct are experiments in the 
design of a see-through system, whose representations and 
processes are almost transparently dear to the domain spe- 
cialist. “Almost” here is equivalent to “with a few minutes 
of introductory description.” The various pieces of 
MYGN — the backward chaining, the English transactions, 
the explanations, etc. — are each simple in concept and re- 
alization. But there are great virtues to simplidty in system 
design; and viewed as a total intelligent agent system. 
MYCIN/TE IRES IAS is one of the best engineered. 

SUlX: signal understanding 

Historical note 

STJYX is a system design that was tested in an application 
whose details are classified. Because of this, the ensuing 
discussion will appear considerably less concrete and tan- 
gible than the preceding case studies. This system design 
was done by H. P. Nii and me, and was strongly influenced 
by the CMU Hearsay U system design (Lesser and Erman, 
1977). 


Task 

SU/X 1 s task is the formation and continual updating, over 
long periods of time, of hypotheses about the identity, lo 
cation, and velocity of objects in a physical space. The 
output desired is a display of the “ current best hypotheses” 


with frill explanation of the support for each. There are two 
types of input data: the primary signal (to be undentood); 
and auxiliary symbolic data (to supply context for the un- 
demanding). The primary signals are spectra, represented 
as descriptions of the spectral lines. The various spectra 
cover the physical space with some spatial overlap. 


Representations 

- The rules given by the expert about objects, their behav- 
ior, and the interpretation of signal data from them are all 
represented in the situation^ action form. The “situations” 
constitute invoking conditions and the “actions” are pro- 
cesses that modify the current hypotheses, post unresolved 
issues, recompute evaluations, etc. The expert's knowledge 
of how to do analysis in the task is also represented in rule 
form. These strategy rules replace the normal executive 
program. 

The situation-hypothesis is represented as a node-link 
graph, tree-tike in that it has distinct "levels,” each repre- 
senting a degree of abstraction (or aggregation) that is nat- 
ural to the expert in his understanding of the domain. A 
node represents an hypothesis; a link to that node represents 
support for that hypothesis (as in HEARSAY II, “support 
from above” or "support from below”), “Lower" levels 
are concerned with the specifics of the signal data. “Higher’' 
levels represent symbolic abstractions. 


Sketch of method 

The situation-hypothesis is formed incrementally. As the 
situation unfolds over dme, the triggering of rules modifies 
or discards existing hypotheses, adds new ones, or changes 
support values. The situation-hypothesis is a commoa work- 
space ("blackboard.” in HEARSA Y jargon) for ail the rules. 

In general, the incremental steps coward a more complete 
and refined situation-hypothesis can be viewed as a se- 
quence of local generate- and-test activities. Some of the 
rules are plausible move generators, generating either nodes 
or links. Other rules are evaluators, testing and modifying 
node descriptions. 

In typical operation, new data is submitted for processing 
(say, N doe-units of new dam). This initiates a flurry of 
rule- triggerings and consequently rule-actions (called 
“events”). Some events are direct consequences of data; 
ocher events arise in a cascade-tike fashion from the trig- 
gering of rules. Auxiliary symbolic data also cause events, 
usually affecting the higher levels of the hypothesis. As a 
consequence, support-froo-above for the lower level pro- 
cesses is made available; and expectations of possible lower 
level events can be formed. Eventually all the relevant rules 
have their say and the system becomes quiescent, thereby 
triggering the input of new data to reenergize the inference 
activity. 

The system uses the simplifying strategy of maintaining 
only ooe “best” situadoa-hypothesis at any moment, mod- 
ifying it incrementally as required by the changing data. This 
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approach it made feasible by several characteristics of the 
domain. First, there is the strong continuity over time of 
objects sad their behaviors (specifically, they do not change 
radically over tunc, or behave radically differently over 
short periods). Second, a single problem (identity, location 
gad velocity of a particular set of objects) persists over 
numerous data gathering periods. (Compare this to speech 
understanding in which each sentence is spoken just once, 
and each presents a new and different problem.) Finally, the 
system's hypothesis is typically "almost right," in pert be- 
cause it gets numerous opportunities to refine the solution 
(Le., the numerous data gathering periods), and in part be- 
cause the availability of many knowledge sources tends to 
©ver-determine the solution. As a result of all of these, the 
current best hypothesis changes only slowly with time, and 
hence keeping only the current best is a feasible approach. 

Of interest are the time-based events. These rule-like 
expressions, created by certain rules, trigger upon the pas- 
sage of specified amounts of tune. They implement various 
"wait-and-see" strategies of analysis that art useful in the 
domain. 


taka 

In the test application, using signal data generated by a 
simulation program because real data was not available, the 
program achieved expert levels of performance over a span 
of test problems. Some problems were difficult because 
there was very little primary signal to support inference. 
Others were difficult because too much signal induced a 
plethora of alternatives with much ambiguity. 

A modified SU/X design is currently being used as the 
basis for in application to the interpretation of x-ray crys- 
tallographic data, the GtYSAJLIS program mentioned later. 


The role of the auxiliary symbolic sources of data is of 
critical importance. They supply a symbolic model of the 
existing situation that is used to generate expectations of 
events to be observed in the data stream. This allows flow 
of inferences from higher levels of abstraction to lower. 
Such a process, so familiar to Al researchers, apparently is 
almost unrecognized among signal processing engineers. In 
the application task, the expectation-driven analysis is es- 
sential in controlling the combinatorial processing explosion 
at the lower levels, exactly the explosion that forces the 
traditional signal processing engineers to seek out the largest 
possible number-cruncher for their work. 

The design of appropriate explanations for the user takes 
an interesting twist in SU /X. The situation-hypothesis un- 
folds piecemeal over time, but the " appropriate" explana- 
tion for the user is one that focuses on individual objects 
over time. Thus the appropriate explanation must be syn- 
thesized from a history of all the events that led up to the 
current hypothesis. Contrast this with the MYCIN-TEI- 


RESLAS reporting of rule invocations in the construction of 
a reasoning chain. 

Since its knowledge base and its auxiliary symbolic data 
give it a model-of-the-ii tuition that strongly constrains in- 
terpretation of the primary data stream, SU/X is relatively 
unperturbed by trrorful or missing data. These data condi- 
tions merely cause fluctuations is the credibility of individ- 
ual hypotheses sad/or the creation of the "wait-and-see" 
events. SU/X can be (but has not yet been) used to control 
season. Since its rules specify whet types and values of 
evidence are necessary to establish support, and since it is 
constantly processing a complete hypothesis structure, it 
can request "critical readings" from the season. In general, 
this allows an efficient use of limited sensor bandwidth and 
data acquisition processing capability. 


Other case studies 

Space does not allow more than just a brief sketch of 
other interesting projects dux have been completed or are 
in progress. 


AM: mathematical di s c o very 

AM is a knowledge-based system that conjectures inter- 
esting concepts in elementary mathematics. It is a discoverer 
of interesting theorems to prove, not a theorem proving 
program. It was conceived and executed by D. Leoat for his 
Fh.D. thesis, and is reported by him in these proceedings. 

AM’s knowledge is basically of two types: rules that sug- 
gest possibly interesting new concepts from previously con- 
jectured concepts; and rules that evaluate the mathematical 
"interestingsess" of a conjecture. These rules attempt to 
capture the expertise of the professional mathematician at 
the task of mathematical discovery. Though Least is not t 
professional mathematician, he was able successfully to 
serve as his own expert in the building of this program. 

AM conducts a heuristic search through the space of con- 
cepts creatable from its rules. Its basic framework is gen- 
eratioo-and-test. The generation is plausible move genera- 
tion, as indicated by the rules for formation of new concepts. 
The test is the evaluation of "mterestingness." Of particular 
note is the method of test-by -example that lends the flavor 
of scientific hypothesis testing to the enterprise of mathe- 
matical discovery. 

Initialized with coocepts of elementary set theory, it con- 
jectured concepts in elementary number theory, such as 
"add," "multiply" (by four distinct paths!), "primes," the 
unique factorization theorem, and a concept similar to 
primes but previously not much studied called "maximally 
divisible numbers." 


MOLGEN: planning experiments In molecular genetics 

MOLGEN, a collaboration with the Stanford Genetics 
Department, is work in progress. MOLGEN* s task is to 
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provide intelligent advice to a molecular geneticist oa the 
planning of experiments involving the manipulation of DNA. 
The geneticist has various kinds of laboratory techniques 
available for changing DNA material (cuts, joins, insertions, 
deletions, and so oq); techniques for determining the bio- 
logical consequences of the changes; various instruments 
for measuring effects: various chemical methods for indue* 
ing, facilitating, or inhibiting changes; and many other tools. 

Some MOLGEN programs under development will offer 
planning assistance in organizing and sequencing such tools 
to accomplish an experimental goal. Other MOLGEN pro- 
grams will check user-provided experiment plans for feasi- 
bility; and its knowledge base will be a repository for the 
rapidly expanding knowledge of this specialty, available by 
interrogation. 

In MOLGEN the problem of integration of many diverse 
sources of knowledge is central since the essence of the 
experiment planning process is the successful merging of 
biological, genetic, chemical, topological, and instrument 
knowledge. In MOLGEN the problem of representing pro- 
cesses is also brought into focus since the expert's knowl- 
edge of experimental strategies — proto-plans— must also be 
represented and put to use. 

One MOLGEN program (Stefik, 1978) solves a type of 
analysis problem that is often difficult for laboratory scien- 
tists to solve. DNA structures can be fragmented by chem- 
icals called restriction enzymes. These enzymes cut DNA 
at specific recognition sites. The fragmentation may be com- 
plete or penial. One or more enzymes may be used. The 
fragmented segments of the DNA are collected and sorted 
out by segment length using a technique called gel electro- 
phoresis. The analytical problem is similar to that faced by 
DENDRAL: given an observed fragmentation pattern, hy- 
pothesize the best structural explanation of the data. More 
precisely the problem is to map the enzyme recognition sites 
of a DNA structure from complete or partial " digests”. 

The program uses the model-driven approach that is sim- 
ilar to DENDRAL's and is discussed earlier. The method is 
gsnerate-aad-test. A generator is initiated (hat is capable of 
generating all the site-segment maps in an exhaustive, irre- 
dundant fashion. Various pruning rules are used to remove 
whole cl as ses of conceivable candidates in light of the data. 
Some of the pruning rules are empirical and judgmental. 
Others are formal and mathematically based. 

The program solves simpler problems of this type of anal- 
ysis better than laboratory scientists. The harder problems, 
however, yield only to the broader biological knowledge 
known by the scientists and not yet available to the pro- 
gram's reasoning proc ess es. In a recent test case, a problem 
whose solution space contained approximately 150,000,000 
site-fragment "maps" was solved in 27 seconds of PDP-10 
time using the INTERLISP programming system. 

Interestingly, the computer scientist's formal understand- 
ing of the nature of the problem, his formal representation 
of the knowledge used for pruning out inappropriate candi- 
dates, and the computational power available to him enabled 
him to suggest a few new experiment designs to his geneticist 
collaborators that were not previously in their repertoire* 


CRYSALIS: Inferring protein structure from electron 
density me pc 


CRYSALIS, too, is work in progress. Its task is to hypoth- 
esize the structure of a protein from a map of electron 
density that is derived from x-ray crystallographic data. The 
map is three-dimensional, and the contour information is 
crude and highly ambiguous. Interpretation is guided and 
supported by auxiliary information, of which the amino acid 
sequence of the protein's backbone is the most important. 
Density map interpretation is a protein chemist's art. As 
always, cap airing this art in heuristic rules and putting it to 
use with an inference engine is the project's goal. 

The inference engine for CRYSALIS is a modification of 
the SXJfX system design described above. The hypothesis 
formation process must deal with many levels of possibly 
useful aggregation and abstraction. For example, the map 
itself can be viewed as consisting of "peaks," or "peaks 
and valleys," or "skeleton." The protein model has 
"atoms," "amide planes," "amino add sidechains," and 
even massive substructures such as "helices." Protein mol- 
ecules are so complex that a systematic generation- and- lest 
strategy like DENDRAL’s is not feasible. Incremental piec- 
ing together of the hypothesis using region-growing methods 
is necessary. 

The CRYSALIS design (alias SU/F) is described in a 
recent paper by Nii and Ftigenbaum (1977). 


SUMMARY OF CASE STUDIES 

Some of the themes presented earlier need no recapitu- 
lation. but I wish to revisit three here: generation- tad- lest; 
tituatioo=> action rules; and explanations. 

Generation and test 

Aircraft come in a wide variety of sizes, shapes, and 
functional designs and they are applied in very many ways. 
But almost all that fry do so because of the unifying physical 
principle of lift by airflow; the others are described by ex- 
ception. If there is such a unifying principle for intelligent 
programs and human intelligence it is generation- and- test. 
Nc wonder that this has been so thoroughly studied in Al 
research! 

In the case studies, generation is manifested in a variety 
of forms and processing schemes. There are legal move 
generators defined formally by a generating algorithm 
(DENDRAL's graph generating algorithm): or by a logical 
rule of inference (MYClN's backward chaining). When legal 
move generation is not possible or not efficient, there are 
plausible move generators (as in SVJX and AM). Sometimes 
generation is interleaved with testing (as in MYC1N, SU/X, 
and AM). In one case, all generation precedes testing (DEN- 
DRAL). One case (META- DEN DRAL) is mixed, with some 
testing taking place during generation, some after. 
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Test tbo show* g put vsriety. There in simple uses 
(MYQN: “U the organism eerobkr ; SU/X: “Has a sph- 
eral Unc appeared at position P?**) Some tests an complex 
heuristic evaluations (AM: "Is the new concept 'interest- 
ing’?*’; MOLGEN: "Will the reaction actually take place?**) 
Sometimes a complex test can involve feedback to modify 
the object being tested (as in META-DENDRAL). 

The evidence from our case studies supports the assertion 
by Newell and Simon that ftacrarion-aod-tast is a law 0 / 
our science (Newell and Simon, 1976). 


Slatarion^action ntUs 

Situarion^Acrioo rules an used to represent experts* 
knowledge in all of the case studies. Always the situation 
part indicitei the specific conditions under which the role 
is relevant. The acaoo pert can be simple (MYQN; con- 
clude presence of particular organism; DENDRAL: con- 
clude break of particular boo d). Or H can be quite complex 
(MOLGEN: an experiential procedure). The overriding con- 
sideration in making design chokes is that the rule form 
chosen be able to represent dearly and directly what the 
expert wishes to express about the domain. As illustrated, 
this may necessitate a wide variation in rule syntax and 
semantics. 

From a study of all the projects, a regularity emerges. A 
salient feature of the Si tustjon=b Action rule technique for 
representing experts’ knowledge is the modularity of the 
knowledge base, with the concomitant flexibility to add or 
change the knowledge easily as the experts* understanding 
of the domain changes. Here too one must be pragmatic, 
not doctrinaire. A technique such as this cannot represent 
modularity of knowledge if that modularity does not exist in 
the domain. The virtue of this technique is that it serves as 
a framework for discovering what modularity exists in the 
domain. Discovery may feed back to cause reformulation of 
the knowledge toward greater modularity. 

Finally, our case studies have shown that strategy knowl- 
edge can be captured in rule form. In TEIRESIAS, the 
metarules capture knowledge of bow to deploy domain 
knowledge; in SU/X the strategy rules represent the ex- 
perts* knowledge of "bow to analyze’* in the domain. 


Explanation 

Most of the programs, and all of the more recent ones, 
make available an explanation capability for the user, be he 
eod-uscr or system developer. Our focus on end-users in 
applications domains has forced attention to human engi- 
neering issues, in particular making the need for the expla- 
nation capability imperative. 

The Intelligent Agent viewpoint seems to us to demand 
that the agent be able to explain its activity; else the question 
arises of who is in control of the agent's activity. The issue 
is not academic or philosophical. It is an engineering issue 
that has arisen in medical and military applications of intel- 


ligent agents, sad will govern fcture a rrtpranre of AI work 
In applications areas. And on the pfafloeophkal level one 
might even argue that there is a moral imperative to provide 
accurate explanations to end-usera whose i n t uitions about 
our systems are almost nil. 

Fussily, the explanation capability is seeded as part of the 
coocertcd attack on the knowledge acquisition problem. Ex- 
planation of the reasoning process is central to the interac- 
tive transfer of expertise to the knowledge base, and it is 
our most powerful tool for the debugging of the knowledge 
base. 
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REMARKS ON THE RELATIONSHIP BETWEEN 
ARTIFICIAL INTELLIGENCE AND COGNITIVE PSYCHOLOGY 

Allen Newell 

Carnegle-Mellon University 
Pittsburgh, Pennsylvania 

1 . INTRODUCTION 

Shortly after I agreed to participate in this conference, I received a letter 
from a psychologist friend, who had been Working in the area of cognitive simulation 
He had become discouraged, feeling that less and less work was going on. He felt 
that attempts to simulate cognitive functioning were a dead end and he was leaving 
the field. He wanted to let me know. 

Now, my own impression is that matters stand rather well in the use of infor- 
mation processing models in psychology. The dissonance between this letter and 
my own view led to considerable reflection over the next several months. This 
seems an appropriate occasion to pass on these reflections. Thus, I wish to address 
myself to the relationship between artificial intelligence and cognitive psychology. 
I will not provide here any survey of the research being done. Nor will I be 
reporting any new research (though in fact some of the odd pieces I will mention 
are fairly recent). 

Furthermore, these are reflections on the relationship. I shall not attempt 
any systematic argument. For that would be, In effect, to argue the necessity of 
my own world view — my own Weltanshauunq. And I agree with the substance of 
Churchman's paper in this conference, that one cannot argue such things directly. 

Let me set the stage by two preliminaries, before moving to the points 
themselves. 


2. THE POSSIBLE RELATIONSHIPS 

We list in Figure 1 a number of possibilities that cover the range of relation- 
ships, that might exist between artificial intelligence and psychology. The list 
moves roughly from weak to strong relationship as one moves from top to bottom. 
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Thus, right at the top, there may be no relationship at all between artificial 
Intelligence and psychology. This Is certainly a possible view: 

fclnce the theory rests' on analogies between the human and the 
mechanical process, Newell et al take’ some pains to produce 
comparisons between human problem solving and the behavior of 
the machine. In this effort [LT] they draw upon previously 
published descriptions of relevant human behavior. They add 
nothing to our further understanding of the living mechanisms, 
but they do provide a better understanding of the computer. 

(T. Kendler, 1961, pp. 451-452.) 

The next stage Is where one feels that artificial Intelligence provides 
metaphors, thus making psychologists attend to new phenomena In appropriate ways. 
This view is the Interpretation many scientists put on cybernetics through the 
forties and fifties. And many people hold It about artificial intelligence now: 
Psychology and the study of artificial Intelligence are both 
concerned with Intelligent behavior, but otherwise they are 
not necessarily related except to the extent that metaphors 
borrowed from one discipline may be stimulating to the other. 

(A.G. Oettlnger, 1969, p. 30.) 

No relationship 

Metaphor / Attention focussing 
Forces operatlonallty 
Provides language 
Provides base (Ideal) models 
Sufficiency analysis 
Theoretical psychology 
Self sufficient 

Figure 1: Possible Relationships between AI and Psychology 
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The next step of engagement Is that emphasis on programs and mechanisms forces 
the psychologist to become operational, that Is, to avoid the fuzziness of using 
mentallstlc terms. It is a sort of mental hygiene. Behaviorism Is In part a 
similar sort of mental hygiene, but one that achieves Its effect by remaining In 
the observation language of the experiment (l.e., the behaviors that can be 
observed). Artificial Intelligence offers an operatlonalism with respect to theory. 
This view has been very popular, as the following quotations testify: 

The advantage of playing this kind of game lies solely In the 
fact that, if you talk about machines, you are more certain to 
leave out the subjective, anthropomorphic hocus-pocus of 
mental ism. ... 

There Is still a further step possible along this same road: 
the design and construction of actual robots who perform 
different human functions as well or better than a man can 
do. ... The only use that lies In designing an actual robot 
Is to make sure that. In stating the properties of a function, 
we have not left In unwittingly some mystic ambiguous mentallstlc 
term. (E. Boring, 1946, p. 191.) 

... On the other hand, the computer program allows us to 
specify with complete precision, complex models that certainly 
embody what we are vaguely point to with these words. We can 
then, as with the concepts "active memory" and "learning" briefly 
discussed here, study our models to get a better Idea of what we 
have been talking about. 

The computer Is just a powerful tool for clearly specifying 
rules that mechanisms must follow In carrying out procedures 
that process information. (L. Uhr, 1969, p. 297.) 
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The next stage sees the language as the major connection: The language of 
programs and data structures (e.g. , list structures) Is the appropriate vehicle for 
describing the behavior of humans, In contradistinction, say, to classical mathe- 
matics. An analogous view was strongly held a decade ago In arguing that for the 
social sciences the appropriate mathematics was that of finite structures (matrix 
analysis, markov processes, graph theory), as opposed to the mathematics of the 
continuum (i.e., differential equations). Perhaps, the clearest statements of the 
language view with respect to artificial Intelligence have been made by George 
Miller: 

The computer program can play a double role In psychology: as 
a model of an Intelligent system and, even more broadly, as a 
kind of language In which theories can be expressed. Everyone 
recognizes the Importance of holding a good theory; the advantages 
of speaking a good language, however, are not so often recognized. 

(p.‘ 9) * 

There is much that the psychologist can learn from a study of 
computing machines and the structure of their programs. Progranm- 
Ing languages seem to offer an excellent medium for the expression 
of psychological theories, even though using such languages Implies 
that men and machines are In some deep sense considered to be equi- 
valent — functionally. If not structurally. (G. Miller, 1962, 
p. 21.) 

The stages of metaphor, operational Ity and language are somehow content free. 
That Is, the gains to psychology are In various behaviors and disciplines of the 
psychologist. The next stage finally accords the product of the artificial Intelli- 
gence models significance, even if not their content. Here artificial Intelligence 
Is used to provide base lines against which to view actual behavior. These base 
lines are In the direction of optimum behavior, rather than In the direction of 
random behavior as In the base lines usually provided for by statistics). Such 
Ideal types are used fruitfully in several places In science. In psychology a good 
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example is the work of Ward Edwards on behavior in uncertain situations, where 
humans are consistently conservative compared to the optimal solution, as computed 
from Bayes theorem. Without this comparison with an ideal system, a significant 
aspect of the data would be missed. In artificial intelligence this view is 
perhaps less conmon than might be suspected, given that computers are programed 
to do the best job possible. Nevertheless, one finds the Attitude expressed 
occasionally: 

The computer analogues used in some of the model of human 
information processing and thought depict ideal Intellectual 
slaves, experiencing practically no time lag, no loss of memory, 
and no reluctance to consider all of the available evidence. 

The human to whom our formulations are meant to apply do 
unfortunately experience considerable limitations In these 
regards. (W.J. McGuire, 1968, p. 159). 

The next turn of the screw reflects a unique feature of human cognitive 
behaviors, namely that they constitute performances for which often we do not know 
• any way that they can be accompllsed. Thus, it becomes of interest to discover 
systems that perform these tasks. If, in addition, no mechanisms are used in 
these systems that clearly go beyond the capacities of the human, then an Initial 
theory has been provided. This level has been called sufficiency analysis , since It 
seeks to show that a sufficient set of mechanisms exists for a particular Intellectual 
task. To Illustrate, if one develops a chess program that examines 800,000 positions 
In deciding on a move, then one has not made a contribution; since excellent evidence 
exists that no human could consider 800,000 separate Items of information in ten 
minutes. But If the chess program only considers around 100 positions, and if 
there are no other ways in which the program radically violates the general 
character of human processing capacities, then It may be taken as a first model. 

An example of this view Is the following: 

The definitions are both nominal and ostensive in the sense that 
when we speak, for example, of "pathogenic conflict" we can 
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point to a precise procedure In the program which computes 
whether two beliefs are In conflict or not. We must postpone 
the question, which eventually must be faced, of how closely 
this corresponds to the nature of pathogenic conflict In real 
persons. But at this point we can say there Is a rough match 
between the output of the program and typical behavior of 
patients In psychotherapeutic sessions. (K.M. Colby and 
J.P. Gilbert, 1964, p. 417) 

This view has a certain value in Itself, since psychology has general Ignored 
the question of explaining how It is that humans can perform the acts of intelli- 
gence they routinely accomplish. Thus, it adds a new mode of analysis. 

With the next turn, we get artificial intelligence as theoretical psychology. 
This Is analogous to the view of the mathematics of differential equations as 
theoretical physics. Thus the actual theories of cognitive psychology are to be 
expressed as artificial intelligence systems. We would expect to find artificial 
Intelligence systems of direct empirical relevance, and also artificial intelligence 
systems being developed for their own sake, just as in mathematics there is 
concern with the differential equations of physical Interest (e.g. , the Mathleu 
equation) and also the pure theory of differential equations. This view has been 
often expressed; for instance: 

Quite typically, these models express psychological propositions 
in terms of individual operations for matching, generating, 
transforming, and retrieving information. These operations 
are knit together to form systems of complexly organized 
structures and processes. Since the structures and processes 
are represented explicitly, such models enable us to go behond 
measures of the quantifiable and statistical properties of 
behavior to investigations of the specific sequences of stimuli 
and responses Involved. ... By comparing model -generated 

W 

behavior with data from humans, we can decide unambiguously 
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whether the model is sufficient to account for the phenomena • 
we are investigating. Concerned as they are with the micro- 
structure of behavior, information-processing psychglogists 
often prefer to work with extensive sequential data from indi- 
vidual subjects. (W. Reitman, 1969, p. 246. } 

There is yet one more twist -- a radical one, but not totally implausible. 

One can view artificial intelligence as sufficient within itself for the entire 
task of understanding the nature of human intelligence. Thus, the behavioral 
data now being gathered and analyzed In psychological laboratories are taken to be 
irrelevant. With our long standing involvement In an empiricist view of science, 
this may seem like nonsense. But consider that the constraints on intelligent 
behavior in our world may be such that there exists In essence, only one type of 
system that can accomplish it. Then we might be able to discover that system by 
direct analysis, knowing only the nature of the world (the organism's task environ- 
ment) and the general kinds of performances of which it is capable. The plausibility 
of this can be enhanced considerably if two conditions are added. First, the basic 
system Itself must have arisen by evolution. Second, the system must be able to 
develop from a basic system (capabilities unknown, but fundamentally simple) to 
one with full intelligence. There are few who subscribe to this viewpoint totally. 
However a hint can be found in the following quotation: 

Nor Is It true that psychologists take the experimental 
evidence Into account but that others [engineers working 
on pattern recognition] do not, for it is not clear that 
much really firm evidence has been collected, except for 
a few scattered findings, chiefly from neurophysiology. 

As horrifying as it may sound to some, the chief sources 
of specification of a model for pattern recognition are 
Intuition and introspection, and in this we all draw upon 
our own resources as human beings. Since these are two 
functions that have made twentieth century psychology 
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especially uneasy, there Is no reason to think that 
psychologists are terribly adept at them. (t. Uhr, 

1966, p. 291.) 

I have laid out this array of viewpoints to locate myself and the nature of 
ny comments. I wish to focus on the strong end — namely, on artificial intelligence 
as theoretical psychology. (I do not, however, go to the last stage.) Thus, I am 
much concerned with the use of artificial intelligence systems as theories for 
detailed and explicit bodies of data on human cognitive behavior. 

The literature that talks about simulation of cognitive processes speaks 
mostly from views down toward the weak end, as I have tried to indicate with the 
quotations. While I think that artificial Intelligence can be relevant to 
psychology In all of these ways, I have always felt that quoting them smacked a 
bit of damning with faint praise. If it Is not possible to do the real job — 1.e_, 
to be theory In the full sense — then one must settle for the advantages that do 
exist. + (To be fair to those who have espoused these various advantages — includ- 
ing myself — clarity about the role of a new development is achieved only slowly.) 

3. WHAT IS ARTIFICIAL INTELLIGENCE? 

The second preliminary is to fix what I mean by artificial intelligence for the 
purpose of this paper. As shown In Figure 2 there is a very large encompassing 
domain labeled variously cybernetic systems , information processing systems, control 
systems, etc. — this entire familiar Interrelated scientific and technological 
domain that has arisen since World War II. One major subdomain is that of symbolic 
systems , which Is pretty much coterminous with the systems of interest to computer 
science. Symbolic systems are to be distinguished from discrete systems, as the 
control theorist uses that term. In having symbols that have referential structure. 
Programnl ng and linguistic systems would be another set of names for the same-area. 

+ Psychology itself has a nice example. One often hears that a good theory Is one 
^.that leads to good new experiments. While true, this virtue often has to serve 
in the absence of more substantial advantages, such as predictive and explanatory 
power. 
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FIGURE 2: Cybernetic Systems and its Subdomains. 
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Within symbolic systems there is a subdomain called heuristic programming , e.g. , 
programs for problem solving, theorem proving, game playing, induction, etc. This 
is part of artificial intelligence, as the term is coimtonly used. There are also 
other parts of artificial intelligence, such as pattern recognition . Some pattern 
recognition systems are symbolic, e.g., the work of Uhr (1961). But other 
pattern recognition systems are discrete, though not symbolic (e.g., neural nets), 
and some are not even discrete (e.g., holographic systems). 

With Figure 2 as background, then, when I refer to artificial intelligence I 
will mean heuristic programming — that is, symbolic systems for performing 
intellectual functions. I will exclude such areas as pattern recognition — not 
because they are any less important, but because they are a different story for a 
different time. 

More important, I wish to broaden my concern from artificial intelligence to 
the whole of symbolic systems. For the right question to ask is not about the 
relation of psychology to artificial intelligence systems, but about the relation of 
psychology to symbolic systems. In fact, this larger view already has a name -- it 
is called information processing psychology. It is to be distinguished from the 
flurry within psychology some years ago on the use of information theory, as 
developed by Shannon (e.g., see Attneave, 1959). Information processing psychology 
is concerned essentially with whether a successful theory of human behavior can be 
found within the domain of symbolic systems. 

The reason for the expansion is clear if you view the matter from psychology's 
vantage point, which wants to construct theories to describe and explain human 
behavior. Symbolic systems provide a possible class of systems within which such 
theories might be formed. Some of the behaviors of interest are primarily problem 
solving — e.g., a man playing a game of chess. But much behavior of interest is 
not intellectually demanding — e.g., learning new information, interpreting a 
conmand in natural language, retrieving a relevant fact. But these tasks are also 
susceptible to an analysis in terms of symbolic systems and information processing. 
Thus, artificial intelligence covers only a part of the relevant systems. 
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I am Insisting on the importance of the general type of system used to form 
specific theories of human behavior -- In our case, symbolic systems. It Is, then, 
worthwhile to note that psychology has searched for its theories mostly in terms of 
classes of systems other than symbolic systems. Behaviorism is in general coupled 
with a view of systems of stimulus and response associations. Gestalt psychology 
Is coupled with a view of continuous fields which reorganize themselves. Psycho- 
analytic theory is framed in terms of energy constructs, with conservation laws 
a major organizing feature. All of these views — and the three of them account 
for a large fraction of psychological theory — are quite distinct from symbolic 
systems. 

This emphasis on the substantive content of information processing models Is 
In sharp contradistinction to the neutrality of 'computer simulation per se. This 
latter has been emphasized by many people. It can be seen In the earlier quote of 
Uhr In connection on operational ity. Here Is another: 

I should like to conclude with this final comment: My 
Insistence that a theoretical formulation be rendered in 
such a manner that It could be converted into a computer 
program does not in itself predispose us toward any par- 
ticular type of theory. ... The model resides wholly in. 
the program supplied to the computer and not at all in the 
hardware of the computer itself. For this reason any 
model can be programmed — provided only that It Is 
sufficiently explicit. (Shepard, 1963, p. 67.) 
fly own Insistence does not conflict with the above statement. Rather, It reflects 
an additional product of the growth of computer science, namely, that of a theoretical 
model of symbolic behavior. After the fact, one can see that such a theory might 
have emerged within psychology (or linguistics) without the advent of the computer. 

In historical fact, the theory emerged by trying to program the computer to do 
non-numerl cal tasks and by trying to construct abstract theories of computation 
and logic. 

With this background, let me now make a series of points. 
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4. POINT ONE: PENETRATION INTO EXPERIMENTAL PSYCHOLOGY 

The first point Is that the penetration of Information processing theories 
Into experimental psychology Is very substantial. To see this, one must take the 
broader view I have just emphasized. Information processing, not artificial 
Intelligence, Is the critical Issue, simply because most tasks Investigated In 
psychology are not problem solving or complex learning. 

Furthermore, the total range of work that now operates within an Information 
processing framework by no means derives from a single source. More precisely, 
the wider domain, which we labeled cybernetic systems in Figure 2, has been the 
comnon source of all the work (especially if we understand It to include develop- 
ments in operational mathematics, such as decision theory and game theory). 

But this broad development has permitted many parallel developments In psychology, 
all converging on the class of information processing systems. Let me briefly 
identify these main lines of development. 

Perhaps the most important one in terms of number of investigators is that 
concerned with the study of immediate memory. In terms familiar to this audience, 
the basic problem is to discover the logical design of the short term memory. 
Actually, there appear to be several such memories, some of tte order of hundreds 
of milliseconds half life, at least one of the order of several seconds. Since no 
anatomical or physiological data exist on these memories, their existence and 
characteristics must be inferred entirely from behavior. Thus, there is even 
controversy over what memories exist (Melton, 1962). 

Now the concern with the logical design of a system does not necessarily imply 
concern with a symbolic system. And, indeed, the genesis of this work goes back to 
communications engineering and information theory. The book by Broadbent on 
Perception and Corrriuni cation (1958), which was one of the milestones in this area, 
shows this very well: signal processing, but not symbol processing. 

What changed this was the discovery that the human Immediate memory appears 
to hold symbols — chunks , to use the term introduced by George Miller in his well- 
known paper on the magic number seven (Miller, 1956). This established that one 
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should consider the human as an information processing system with a short term 
memory of constant capacity, measured in number of symbols. By now, this view 
permeates all work, as can be seen in the numerous models of short term memory 
that are now available (many of them summarized In Norman (1969)). 

A second development is in psycholinguistics, where the work of Chomsky has 
had a very large impact. First, observe that Chomsklan linguistics implies a 
symbolic system. One can emphasize, as have the linguistics, that performance 
should be distinguished from competence, so that a model of the linguistic ability 
(t.e., the set of syntactical rules) does not imply that language is In fact 
processed in a person by a machine that takes the rule system as input. However, 
if one wants to draw any inspiration from linguistics for psychology, then it will 
still be a system of this kind i.e. , some kind of a system that deals with 
discrete symbols with rules and transformations on those symbols. 

This Is exactly what has happened in psycholinguistics, where many studies are 
being performed, taking seriously the notions of linguistic transformation and the 
encoding of meaning (semantics) in the so-called deep structure (Chomsky, 1967). 

The attempt to characterize the development of children's grammars, which thereby 
attributes to them a (simple) system of rule following behavior on symbol structures 
(language utterances), is part of the same picture (Smith and Hiller, 1966). 

Problem solving . A third development Is the simulation of cognitive processes 
in problem solving by means of computer programs. This is the development associated 
with (Intimately entwined with, would be a better phrase) artificial intelligence. 

The problem solver is viewed as a symbolic system, capable of following strategies 
of search, applying heuristics, calculating results, both symbolic and (on occasion) 
numeric, and evaluating partial results. The efforts referred to here are those 
one would also consider psychology (tn line with the choices with respect to 
Figure 1), namely, those where direct comparison Is made between the symbolic 
system and data from human behavior. Good representatives of this work can be 
found In the well-known collection by Feigenbaum and Feldman (1963) (see also 
Reitman, 1965). 
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Concept formation . A fourth area of development Is in the study of concept 
formation. Work in this area, of course, goes back many years (e.g., to Hull, 

1920). A major turning point Is symbolized by the book by Bruner, Goodnow and 
Austin (1956), which made extensive use of the notions of strategy and hypothesis 
formation, as well as cognitive strain (being essentially the dynamic memory load 
needed to car »7 out various strategies). The system implied there was very much 
a symbolic system, though Its inspiration came out of decision and game theory, 
rather than computer science. 

However, though there has been substantial work in artificial intelligence on 
concept formation (inspired in large part by the Bruner, Goodnow and Austin 
analysis) and even on Information processing models for its psychology (e.g*, Hunt, 
1962; Hunt, Marin and Stone, 1966), most of the upsurge of work that followed in 
the late fifties and early sixties could not reasonably be seen as working within 
an information processing framework. It would be better characterized as a 
straightforward experimental investigation of psychological phenomena, in which 
various limited questions were posed and investigated without any deep commitments 
to the type of processing system implied. For example, studies were done to show 
that there was a systematic effect of the number of relevant versus irrelevant 
dimensions in the stimulus configurations; and to show the effect of the availa- 
bility of past information (Bourne, 1966). 

However, gradually more explicit assumptions have been made about the nature 
of the subject's processing — first in terms of hypothesis testing (Restle, 1962), 
more recently in terms of general rule-following behavior (Haggard and Bourne, 1965). 
These shifts imply a symbolic processing system. 

Summary . My purpose in quickly going over these lines of development is not 
to establish them in any detail — for this I have hardly done. It is to call your 
attention to the use of symbolic models in many places throughout experimental 
psychology. It suggests (and I maintain) that a shift in the Zeitgeist in 
•psychology has taken place toward a view of man as an information processor. 

In fact, I have left out several additional lines of development, for example 
the work in verbal learning. Although the non-psychologist can be pardoned for 
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thinking that this Is coextensive with psycholinguistics. In fact It Is a separate 
experimental tradition going back to Ebbinghaus and his use of nonsense syllable 
learning (1885). Work on the learning of verbal materials — serial lists, paired 
associates, and free recall —have been one of the bastions of S-R psychology, 
since the phenomena lend themselves well to explanation In terms of the formation 
of associations. 

Let me quote a paragraph from a recent study by a psychologist who. has long 
worked in this area. The study is entitled, "Image as a mediator in one-trial 
paired-associated learning." It seeks to Investigate the use of mneumonic devices 
in memorization. It has long been known that If you want to memorize a list of, 
say, ten Items, then a good we.y. to proceed is by having an already learned list 
of associations, say, 1-bun, 2-shoe, 3-tree, 4-door, ... 10-hen, and then (to 
memorize the new material) forming a bizarre visual scene Involving each of the 
Items and the word In the permanent list.. That is, if the first item to be 
memorized was whale , then visualize the whale with a bun In Its mouth; If the 
second was a bicycle , then visualize the bike riding down the toe of the shoe, and 
so on. It will then be found (so goes the lore) that the k^ item can be reliably 
recalled by going from the' number, say 4, to Its word, say door, and then to the 
visual scene, from which the object can be recalled. (The "1-bun,..." list Is 
memorized once and can be used for a lifetime.) 

The present study is a preliminary effort to make some 
experimental contact with the hypothetical construct of 
visual Image with no immediate Intent to assert the reality 
of such a phenomenon. In the present study S's were 
Instructed to form visual Images and to use them in memoriz- 
ing lists of words. Whether or not they did so may remain 
in question. The fact that they accepted the Instructions 
and maintained that they followed them cannot be denied. In 
this report the term "image" will be used to refer to the 
processes S's said they followed when Instructed to "picture" 
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the 10 articles mentioned in connection with the previously 
learned list of 10 words that rhyme with the first 10 numbers. 

(B.R. Bugelski, E. Kidd and J. Segmen, 1968, p. 70.) 

This quotation accurately reflects the present state of verbal learning research. 

It is still much enmeshed in a behavioristic stance, which views with alarm 
attempts to deal with internal processes (symbolic or otherwise). But they are 
being driven to such attempts, in large part because of a shift in view to an 
organism as an active, symbol manipulating system. In the decades prior to the 
current one, such notions as imagemediated paired associate learning simply did 
not call for investigation. The current attempts testify to the shift In the 
Zeitgeist. 

A final comment: if one looks at where the excitement has been over the last 

ten years in psychology -- the places where rapid growth is taking place and which 
people talk about when asked "what's new" — a substantial fraction of these turn 
out to be connected to this shift towards information processing models. The 
work on immediate memory is one; the rise of a linguistically oriented psycho- 
linguistics is another; the study of children's grammar (within psycholinguistics) 
is a third. (Possibly the work on problem solving is yet another, but that Is 
more difficult for me to assess, since I am so involved in it.) 

5. POINT TWO: FROM IMMEDIATE MEMORY TO IMMEDIATE PROCESSOR 

In the discussion of the possible relationships of information processing 
models to psychology we opted for the use of such models as detailed theories of 
behavior, rather than, say, metaphors or exercises in the discipline of operational- 
ism. Even taking for granted the extent of the activity discussed above, there Is 
still the question of its nature. Does the work on immediate memory use the notions 
of information processing only as a metaphor, rather than theory? After all. In a 
primarily experimental discipline (such as psychology still remains), one can play 
fast and loose with many verbal formulations and many metaphors, so long as they 
lead to asking interesting experimental questions. 
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Let me pursue this question with respect to the work on immediate memory. To 
understand this area you must know some background. The behaviorist era In 
psychology, which reigned in its various forms for the thirty years prior to 
World War II, moved the question of learning to be the central question of an 
objective psychology. The study of sensation and perception gradually came to 
take subordinate places. Even more so, the study of memory became simply an 
aspect of learning. When work on immediate memory was restimulated in the fifties 
and sixties, it was largely as a re-emphasis within the notion of learning. Thus, 
these studies could be conducted with only the issues of memory in mind — the nature 
of acquisition, retrieval, capacity, reliability, etc. 

If I were to suggest to this audience that they study the structure of an 
unknown Information processing system, then certainly the kinds of memories would 
be of prime importance, i.e., their capacities and access characteristics. But 
the nature of the rest of the central processor would be of equal interest, I.e., 
the control structure and the basic processing operations. Almost none of this 
concern with processing, as opposed to memory, is evident in the earlier psycholog- 
ical literature on immediate memory. But recently — within the sixties — there 
has been a shift toward such concern. And this shift carries with it the use of 
Information processing theories in detail. 

Some brief examples are appropriate to show this situation. I will not attempt 
any historical comparison, but rather give examples of current work that uses 
information processing assumptions, not as metaphor but as a theory. 

If we ask a subject "What is the 7th letter after G in the alphabet?" 

(Answer: N), it will take him about a second and a half to respond. If we vary 
this question by changing the starting letter and the number, then we get a curve, 
such as that shown in Figure 3 for subject RS. If we kept at our subject long 
enough, we might expect him to memorize all the answers (there are only 26x25 = 650 
distinct questions), in which case the time to respond might be independent of the 
details of the question. But barring that, the subject must somehow generate the 
answer. The figure immediately suggests that he does this by counting down the 
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Number 


FIGURE 3: Average reaction time to count down alphabet (adapted from 

Olshavsky, 1965, Fig. 2). 
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alphabet at a constant rate (he is silent during the interval between question and 
answer, so may proceed any way he wants). That is, we model our subject as a 
simple serial processing system which has operations of "get next," "add 1," 

"test if tally = n" and "speak result," along with some control structure for 
Integrating the performance into a repetitive loop. The linearity arises because 
the same operations are being performed repetitively. 

This particular figure, taken from a Masters thesis at CMU (Olshavsky, 1965), 
is not an isolated example. It shows several things that characterize much of 
the experimental work on the immediate processor. First, the task is very simple, 
thus illustrating the earlier point that information processing systems, not 
artificial intelligence systems should be our main concern. Second, the response 
measure is reaction time, so that the task is to infer the structure of a complex 
process from the time it takes to perform It. Third, a population of tasks is 
used, so that some gross aspect, such as the linearity in Figure 3, contains the 
essential induction from data to mechanism. Since, in fact, reaction times are 
highly variable, it is this last feature (initiated by Neisser, 1963) which 
distinguishes current work from a long history of earlier work on reaction times 
that didn't bear such fruit. 

Figure 4, from a study by Sternberg (1967), reinforces these points. He gave 
his subject a set of digits, say (1, 3, 7), and then asked them if a specified 
digit, say 8, was a member of the set. He finds, as the figure shows, that not 
only does it take longer to answer the question for larger sets, but the relation- 
ship is linear. Thus, again, the natural interpretation is according to a process- 
ing system engaged In repetitive search. (Though the search here is through Immediate 
memory, whereas it was through long term memory in Figure 3.) Now the point of 
showing this second example is that Sternberg goes on to use this basic result 
in an Ingenious way. In one condition he presents the subject with a fuzzy, 
degraded image. What should happen? 

We know. Independently, that It takes longer to compare a degraded image than 
* clear one to a known digit. One possibility is that the subject works with the 
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Image, thus having to make the more difficult comparison at each step of the 
search. If this were the case, the slope of the data line should be greater for 
the fuzzy image than for the clear image. A second possibility is that the 
subject Initially identifies which digit the fuzzy image represents and then com- 
pares an internal representation on each stage of the search. In this case, the 
slope should be the same, but there should be extra time for initialization. As 
Figure 4 shows, the latter clearly prevails. Thus we can infer that the operation 
of perceptual identification occurs prior to the search in immediate memory. 

The point of this study, for us, is to see how definitely Sterberg is working 
with a processing model. The situation is so simple that the key properties can 
be inferred without creating a program to simulate the subject. But the dependence 
on the detailed theory is no less for that. 

I will present you one more example, since I really wish to convince you of 
the extent to which information processing theories are taking hold at the level 
of studying the Immediate processor. This is work done by Donald Dansereau in a 
Ph.D. thesis just completed at Carnegie-Mellon (Dansereau, 1969). He studied the 
process of mental multiplication, e.g., "Multiply 27 by 132 in your head and when 
you are through, give the answer." His subjects were all highly practiced; even 
so t it takes a substantial length of time--e.g., about 130 seconds for 27x132. 

Again, as with these other studies, time was the measure, and he gave his subjects 
a large population of tasks. 

Now the fundamental fact about mental multiplication is that any crude 
processing model does quite well. That is, a reasonable count of the steps required 
by the method the subject uses (e.g., 62x943 requires 5 holds for the given digits, 

6 single-digit multiplications, 9 additions, 4 carries and 11 holds for a total 
difficulty factor of 35) does quite well in predicting the time taken. Figure 5 
shows actual times taken versus this difficulty factor for a particular subject. 

The linear regression accounts for about 90% of the variance. However, this result 
Is not at all sensitive to the exact assumptions. Other work has gotten similar 
results with quite different measures (Thomas, 1963), though in all cases they are 
crude processing models. 
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Oansereau went on to construct a more refined model. In which he postulated 
several kinds of memories with associated transfer times between and within 
memories. There was an Image store, where operands had to be positioned, as in a 
template, in order to be added or multiplied. There was a short term memory that 
held a small number of digits, e.g. , the definition of the problem or intermediate 
results. Finally, there was a long term memory in which information could be 
fixated for an Indefinite period of time. The transfer times are shown in 
Figure 6. They were obtained from Independent experiments, either already in the 
literature or done by Oansereau. Thus, these times are not parameters to be 
estimated from the primary data on performance. 

Figure 7 shows the results of this model. The system is complex enough to 
require simulation. The times taken by the simulation are shown as open circles 
and the actual times by the solid circles. Both are plotted against the difficulty 
factor used in the prior figure. (Thus there are many dots for a given difficulty 
factor, since there are many different multiplication problems with the same factor.) 
It can be seen clearly that the simulation has provided a next order of improvement, 
fitting the "staircase" effect of the actual data. This fit is not due to an 
excess of parameters, since the only parameter used to fit the data was a scale 
change. All others, as. remarked above, were estimated independently from other 
data. 

Although we have no space to discuss it, the model shows that very little 
time Is spent In the act of multiplying or adding. Rather, significant amounts of 
time are spent In memorizing intermediate results (which we expected) and in 
positioning operands (which we did not expect). 

This work shows clearly the shift from models of memory to models of the 
immediate processor. Memory, of course, remains central to the system, but there 
is much more as well. Furthermore, we have moved to where an explicit theory must 
be built of the situation (the simulation), even though the task is still not one 
that artificial intelligence finds of much Interest per se. 
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6. POINT THREE: ON BEING SERIOUS 

I have tried to illustrate with examples from one area. Immediate memory, that 
theories of man as an information processor are being used in serious and detailed 
ways. I would now like to turn this conclusion around. There have always been two 
feelings held by workers In artificial intelligence about themselves: (1) they 

were proceeding independently of any concern with human behavior (i.e., not 
simulating); alternatively (2), they were in fact being relevant to how man thinks. 
Both these views are, in my mind, legitimate — including their conjunction, which 
has been my personal position In some of our work (e.g., GPS). 

I wish to address myself to those of the second (simulating) persuasion. By 
now, anyone who is serious about the psychological relevance of his work in 
artificial intelligence had better be prepared to deal with detailed data of humans 
in specific situations, experimental or otherwise. As we discussed in connection 
with Figure 1, there are many ways in which a work in artificial intelligence could 
be considered relevant to the study of human behavior. All these ways remain 
legitimate. But the gradual success of the detailed, use of information processing 
theories means that none of the less demanding ways carry much punch (though there 
will always be exceptions, naturally). 

This same point was reached some years ago with respect to neural modeling and 
physiology. No neural modeling is of much interest anymore, unless it faces the 
detail of real physiological data. The novelty and difficulty of the tasks under- 
taken by heuristic programming has tended to push the corresponding day of reckoning 
off by a few years. The development of symbolic systems that would behave In any 
way Intelligently produced sufficiency analyses that were in fact relevant to the 
psychology of thinking. But the automatic relevance of such efforts seems to me 
about past. 

Let me illustrate this point briefly. In the last few years Ross Quillian 
has developed a model of semantic memory (Quillian, 1965, 1969). Many of you are 
undoubtedly aware of it; Bob Simmons discussed it to some extent in his paper at 
this conference. The essential features are (1) each concept is a node in a 
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semantic net (as in several other programs, such as SIR (Raphael, 1964); and 
(2) a complex structure encodes the definition (as in dictionary definition) of 
the word, thus relating it to the other concepts used in its definition. In his 
original work he used the task of giving the system two words, e.g., FIRE and BURN, 
and having it state the relationship between these concepts; e.g., FIRE IS CONDITION 
WHICH BURN, also TO BURN CAN BE TO DESTROY SOMETHING ON FIRE. 

Now this program is an example of sufficiency analysis, as we have used the 
phrase. For the system is not Intended as a detailed model of human memory and It 
was never tested as such. But it is relevant to psychology, because he was able 
to make (and demonstrate via the living program) conceptual progress in how human 
memory might be structured for tasks where we understand by general experience what 
performance can typically be expected of humans. Indeed, the work was a Ph.D. 
dissertation in Psychology at Carnegi e-Mel Ion (Quillian, 1966). 

There Is a sequel to this work — and It makes my point. Quillian is, indeed, 
interested in the psychology of human memory. Thus, he followed up this work In 
sufficiency analysis with an attempt to explore whether human memory could be 
modeled by such a structure (Collins and Quillian, 1969). The essential feature 
of a semantic net is that Information about a concept Is not all localized at the 
node corresponding to that concept, but Is distributed through the network. Thus, 
that a canary can sing, might be located at canary , but that a canary can fly is 
probably not located at canary , but at bird , since it is a property of all birds. 
Similarly, that a canary has skin is probably not even located at bird , but rather 
an animal . If this were the case, then It should take longer for such a system to 
answer yes or n£ to such questions (when embedded in a population of other questions, 
such as "Does a house sing?", "Does a cat fly," etc.). Further, if the net is 
homogeneous in Its structure, then there should be a constant operation time to go 
from node to node in the net. 

Figure 8 shows the results of asking these questions experimentally of humans, 
using reactions times. The points are averages over populations of questions of 
similar type. The quantity of interest is the difference between points, as 
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indicated above. Three of them are essentially Identical at the 80 ms. The fourth, 
from "A canary is a canary" to "A canary- is a bird" is too large, due, it appears, 
to the reaction to the former being too fast. But this is the one question that 
admits of an answer by perceptual matching, avoiding the meaning of the word 
"canary" altogether (hence the time to go from image node in the net). There Is 
other evidence in the literature that Indicates the same phenomena, e.g.. It takes 
longer to recognize that A and a. are the same than that A and A are the same, since 
(apparently) the latter can be done by a perceptual match and the other not 
(Posner, 1968). 

Before leaving Quillian's work, let me note that a number of assumptions are 
embedded in Figure 8. Thus, nothing guarantees that the particular words are 
related as the experimental analysis assumes, even if the structure of memory is 
precisely of this postulated type, i.e., the information that a canary can fly 
could be as close to "canary" as that It could sing. Thus, Collins and Quillian 
attempted to select a population of words that had a high prior plausibility of 
being this way -- and were successful, as you can see. 

To make the point of this work for us explicit: (1) if you want to assert 

the relevance of a theoretical information structure to psychology, then you had 
better build a bridge from it making use of experimental data; (2) it can be done. 
The bridge built by Collins and Quillian is a frail one, of course, since It only 
addresses one tiny aspect of the total memory system and it is compatible with 
many other similar structures. Indeed, as you would expect, neither Quillian nor 
anyone else thought the original structure was right in detail, and his second 
iteration Is a much modified system (Quillian, 1969). But with the experimental 
work, it has passed well beyond metaphor. 

Even more briefly, let me present one more example. This is taken from my own 
work with H.A. Simon on problem solving. I Insert it, both because I'm always 
inclined to mention some of my favori ty psychology, especially when it fits the 
story so well, and because none of our examples, with the possible exception of 
Quillian's memory structure, are from artificial intelligence. 
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We work Intensively with cryptarlthmetlc tasks, such as the SEND+M0RE=M0NEY 
that Flkes used to describe the operation of his program, REF-ARF In this confer- 
ence. It Is a puzzle that still retains modest challenges as a task for various 
problem solving programs. We could simply work with programs for solving this 
task, say like REF-ARF, or even some that are more like we Imagine a human proces- 
sor to be constructed, e.g., with a short term memory, a long term memory, etc. 

From these we could draw various conclusions about the general character of human 
problem solving. In fact, we did exactly this with the Logic Theorist, our 
original program (Newell, Shaw and Simon, 1958). But for some time now, since the 
first work with GPS (Newell, Shaw and Simon, 1960, 1961), we have taken an attitude 
much as I am trying to prescribe here. 

Thus, our typical operation Is to present a subject with the task, asking him 
to talk aloud as he works on It. A fragment of the result, called a protocol, Is 
shown in Figure 9, where the task is DONALD+GERALD=ROBERT and D=5 is given as 
initial Information (Newell, 1966, 1967). We then attempt to construct a process- 
ing system that mirrors the behavior of the subject and agrees with what we know 
about human. processing capabilities. 

Typical collegiate subjects In this task can be described wi th excellent 
fidelity as working in a problem space , whose elements are the states of knowledge 
the subject can have about the task, and whose structure is given by a small set 
of operators that work on a given state of knowledge to produce a new state of 
knowledge. Problem solving Is search through this problem space. This search, 
what we call the problem behavior graph (PBG), is shown in Figure 10 for the 
fragment of protocol shown in Figure 9. The four operators used by this subject 
are Process a column (PC), Assign a digit to a letter (AV), Generate the possible 
values for a letter (GH), and Test If a digit - letter assignment is legal (TO). 
(Actually, these are four specific variants of processes that meet these four 
general functional descriptions.) 

The problem space, with its operators, is a reasonable description only if 
'there exists an information processing system that describes the way the subject's 
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Subject 3 Problem DONALD D=5 (adapted from Newell, 1967). 
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search goes in this problem space. It is our fashion to write these- programs as 
production systems , meaning an ordered set of condition-action expressions with a 
control structure that continually executes the first expression whose condition is 
true. (The actual expression being executed changes as the action parts continually 
modify the Inmedlate memory, which is what the conditions test.) Again, there is 
no space to describe a complete production system, for our subject. A typical 
production, available in almost all subjects can be paraphrased, "If a new equality 
expression has just occurred, then find a column that contains the letter involved 
and process .that column." In symbols: 

<letter> * <digit> new -*• Find-column(<letter>); Process-column (column). 

The left side is written as a BNF grammar so that any expression in the knowledge 
state of the right form will trigger the production (e.g., 'T = 0 new’). 

Given a production system (the one for the subject of Figure 9 and 14 pro- 
ductions), one can attempt an accounting of the nodes of the problem behavior graph 
that are described successfully by a production. Figure 11 shows this accounting 
for the total protocol, in which the productions are ordered in terms of decreasing 
marginal utility. There are two kinds of errors. First, a certain fraction of the 
nodes in the problem behavior graph are not covered (errors of omission); this 
gradually drops to 11% with the total set of productions. Second, the wrong pro- 
duction gets evoked at a node in the graph, due to the ordering (errors of commission); 
these gradually rise as the set of productions increases to about 9% or 14% (two 
levels of error were counted in the original work). 

This brief description is not meant to do more than indicate a scientific 
style, namely, one where programs (the production systems) are constructed to deal 
with specific Instances of human data, with quantitative assessment of the 
comparison. Not revealed in this brief account is the concern for understanding 
how the programs of different subjects compare, so that a general theory of human 
problem solving emerges. The example should reinforce the story told by the 
earlier ones -- that if one is serious about the implications of work in artificial 
Intelligence for psychology, then one must come to terms with data on human 
behavior. 
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F-38 



397 


7. FINAL POINT: ON PSYCHOLOGY'S PREFERENCES 

The main points are now made, in response to the letter from my psychologist 
friend. First, the frame of reference must be expanded from artificial intelli- 
gence to information processing systems on symbolic structures. Then, it is found 
that a very substantial penetration has occurred of Information processing theories 
Into psychology. Second, there Is beginning a shift in experimental psychology 
from a concern exclusively with immediate memory to a concern with the whole of the 
Imnediate processor. This testifies to the use of information processing theories 
at a detailed level, and not just as experiment-guiding metaphor. Third, the 
coupling has become intimate enough that artificial intelligencers must take 
experimental data seriously if they want to claim any direct relevance of their 
work to psychology. 

With these points made, I should rest content. But there is one place where 
I have finessed ny letter-writing friend, and I must give him his due. For he 
did mean artificial intelligence and not information processing psychology In 
general (at least, so I believe). Now, whereas I would insist on the latter view, 
there. is one respect in which his concern remains justified. The higher mental 
processes in American psychology have always held a secondary place. There are 

many reasons for this: the impact of behaviorism that simply denied the relevance 

of the mental; the feeling that the important thing was the elements (i.e., the 
basic act of learning), out of which complexity would grow automatically; the sore- 
what fanatical adherence to the scientific canon of simplicity first ; the actual 
messiness of the area, as revealed by the relatively few, but continual, forays 

that have occurred. The reasons make little difference. The psychology of thinking 

and problem solving makes up a rather sad chapter in the history of psychology. 

It Is one of ny fond hopes that the situation has finally changed — that we 
have the techniques to deal with integrated goal oriented behavior on a par with 
any other part of psychology. But I am afraid that my correspondent is correct: 
that psychologists will not shift in mass to the study of thinking and problem 
solving. These areas will remain relatively lightly populated — and this was in 
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part what he was decrying, for he expected it to be different some ten years after 
the initial work. 

I think he is correct in his depression, since, as we have seen, there is large 
contact between information processing theories and psychology at the level of 
immediate memory (and psycholinguists too, though I didn't stress It as much). 

These are extremely exciting areas at the moment, as noted earlier. They can be 
attacked with relatively small amounts of theory and large amounts of sophisticated 
experimental techniques. They fit psychology's image of the proper thing to study: 
basic structure and not too much complexity. Thus, as experimental psychologists 
move towards assimilating information processing theories, they will gravitate 
towards the study of the immediate processor and basic language structure, and not 
toward thinking and complex problem solving. It is not without significance that 
only one out of five of my examples came from the heartland of artificial intelli- 
gence. 
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